WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 



PCX 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 

WO 99/33868 


(51) International Patent Classification ^ 
C07K 14/00 


A2 


(11) International Publication Number: 
(43) International Publication Date: 


8 July 1999 (08.07.99) 


(21) International Application Number: PCT/EP98/08563 

(22) International Filing Date: 18 December 1998 (18.12.98) 


(30) Priority Data: 

9727262.9 


24 December 1 997 (24. 1 2.97) GB 


(71) Applicant {foT all designated States except US): SMITHK- 

LINE BEECHAM BIOLOGICALS S,A. [BE/BE]; Rue de 
rinstitut 89, B-1330 Rixensart (BE). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): DALEMANS, Wilfried, 
L., J. [BE/BE]; SmithKline Beccham Biologicals S.A.. 
Rue de rinstitut 89. B-1330 Rixensart (BE). GERARD, 
Catherine. Marie, Ghislaine [BE/BE]; SmithKline Beecham 
Biologicals S.A., Rue de rinstitut 89, B-1330 Rixensart 
(BE). 

(74) Agent: DALTON, Marcus, Jonathan, William; SmithKline 
Beecham, Two New Horizons Court, Brentford, Middlesex 
TW8 9EP (GB). 


(81) Designated States: AL, AM, AT. AU, AZ, BA, BB, BG, BR, 
BY, CA, CH, CN, CU, CZ, DE, DK, EE. ES. FI, GB, GD, 
GE, GH, GM, HR. HU. ID, IL, IN, IS, JP, KE, KG, KP, 
KR. KZ. LC. LK, LR. LS, LT, LU, LV, MD, MG, MK, 
MN, MW, MX, NO, NZ, PL, FT, RO, RU, SD, SE, SG, 
SI, SK, SL, TJ, TM, TR, TT. UA, UG, US, UZ. VN. YU, 
ZW, ARIPO patent (GH. GM, KE, LS. MW, SD. SZ, UG, 
ZW). Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU. TJ, 
TM), European patent (AT. BE, CH. CY. DE, DK, ES, Fl, 
FR, GB, GR, IE, IT, LU, MC, NL, PT, SE), OAPI patent 
(BF, BJ. CF, CG, CI. CM, GA. GN, GW, ML, MR, NE, 
SN. TD, TG). 


Published 

Without international search report and to be republished 
upon receipt of that report. 


(54) TiUe: VACCINE 


(57) Abstract 

-Hie present invenUon provides Human Papilloma Virus (HPV) fusion proteins, linked to an immunological fusion partner that provides 
T helper epiptopes to the HPV antigen. Vaccine formulations are provided that are useful in the treatment or Prophylaxis of HPV mduced 
tumours. 


Serial No. 09/802,445 



FOR THE PURPOSES OF INFORMATION ONLY 
Codes used to identify States party to the PCT on the front pages of pamphlets pubHshing intemaUonal applications under the PCT. 


AL 

Albania 

ES 

Spain 

LS 

Lesotho 

SI 

Slovenia 

AM 

Armenia 

FI 

Finland 

LT 

Lithuania 

SK 

Sbvakia 

AT 

Austria 

FR 

France 

LU 

Luxembourg 

SN 

Senegal 

AU 

Australia 

GA 

Gabon 

LV 

Latvia 

sz 

Swaziland 

\Z 

AzcTtnaijan 

GB 

United Kingdom 

MC 

McHiaco 

TD 

Chad 

BA 

Bosnia and Herzegovina 

GE 

Georgia 

MD 

Republic of Moldova 

TO 

Togo 

BB 

Barbados 

GH 

Ghana 

MG 

Madagascar 

TJ 

Tajtlcistan 

BE 

Belgium 

GN 

Guinea 

MK 

The former Yugoslav 

TM 

Turkmenistan 

BF 

Burkina Faso 

GR 

Greece 


Republic of Ma»donia 

TR 

Turkey 

BG 

Bulgaria 

HU 

Hungary 

ML 

Mali 

TT 

UA 

Trinidad and Tobago 

BJ 
BR 

Renin 

IE 

Ireland 

MN 

Mongolia 

Ukraine 

Brazil 

XL 

Israel 

MR 

Mauritania 

UG 

Uganda 

BY 

Belarus 

IS 

Iceland 

MW 

Malawi 

US 

United States of America 

CA 

Canada 

IT 

Italy 

MX 

Mexico 

uz 

Uzbekistan 

CF 

Central African Republic 

JP 

Japan 

NE 

Niger 

VN 

Vict Nam 

CG 

Congo 

KE 

Kenya 

NL 

Netherlands 

YU 

Yugoslavia 

CH 

Switzerland 

KG 

Kyrgyzstan 

NO 

Norway 

ZW 

Zimbabwe 

CI 

Cdcc d' [voire 

KP 

Democratic People's 

NZ 

New Zealand 



CM 

Cameroon 


Republic of Korea 

PL 

Poland 



CN 

China 

KR 

Republic of Korea 

PT 

Portugal 



CU 

Cuba 

KZ 

Kazakstflii 

RO 

Romania 



CZ 

Czech Republic 

LC 

Saint Lucia 

RU 

Russian Federation 



DE 

Germany 

LI 

Liechtenstein 

SD 

Sudan 



DK 

Denmark 

LK 

Sri Lanka 

SF, 

Sweden 



KE 

Estonia 

LR 

Liberia 

sc 

Singapore 





wo 99/33868 PCT/EP98/08563 

VACCINE 

The present invention relates to vaccine compositions, comprising an E6 or/ 
and E7 or E6, E7 fusion protein from an HPV strain optionally linked with an 

5 immunological fusion parmer and formulated with a CpG containing oligonucleotide 
into vaccines that find utility in the treatment or prophylaxis of human papilloma virus 
induced tumours or lesions. In particular, the present invention relates to vaccines 
comprising fusions proteins, comprising a protein or part of a protein that provides T 
helper epitopes (such as protein D from Haemophilus influenzae B) and an antigen 

10 from a human-pap illoma virus (eg comprising an E6 or E7 protein from HPV 16 or 18 
strain associated with cancer) that find utility in the treatment or prophylaxis of 
human papilloma induced tumours, wherein the vaccine is formulated with a CpG 
containing oligonucleotide as an adjuvant. 

Papillomaviruses are small naked DNA tumour viruses (7.9 kilobases, double 

1 5 Strand), which are highly species-specific. Over 70 individual human papillomavirus 
(HPV) genotypes have been described. Papillomaviruses are classified on the basis of 
species of origin (human, bovine etc.) and of the degree of genetic relatedness with 
other papillomaviruses from the same species. HPVs are generally specific for the 
skin or mucosal surfaces and have been broadly classified into "low" and "high" risk 

20 viruses. 

Low risk HPVs usually cause benign lesions (warts or papillomas) that 
persist for several months or years. High risk HPVs are associated with pre-neoplastic 
lesions and cancer. The strongest positive association between an HPV virus and 
human cancer is that which exist between HPV 16 and 18 and cervical carcinoma. 
25 More than ten other HPV types have also been found in cervical carcinomas including 
HPV 3 1 and HPV 33 although at less frequency. 

Genital HPV infection in young sexually active women is common and most 
individuals either clear the infection, or if lesions develop, these regress. Only a 
subset of infected individuals has lesions which progress to high grade intraephithelial 
30 neoplasia and only a fraction of these progress further to invasive carcinoma. 

The molecular events leading to HPV infection have not been clearly 
established. The lack of an adequate in vitro system to propagate human 
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papillomaviruses has hampered the progress to a best informaiion about the viral 
cycle. 

Today, the different types of HPVs have been isolated and characterised with 
the help of cloning systems in bacteria and more recently by PGR amplification. The 
5 molecular organisation of the HP V genomes has been defined on a comparative basis 
with that of the well characterised bovine papillomavirus type 1 (BPV 1). 

Although minor variations do occur, all HPVs genomes described have at least 
seven early genes. El to E7 and two late genes LI and L2. In addition, an upstream 
regulatory region harbors the regulatory sequences which appears to control most 
10 transcriptional events of the HPV genome. 

EI and E2 genes are involved in viral replication and transcriptional control, 
respectively and tend to be disrupted by viral integration. E6 and E7 are involved in 
viral transformation. E5 has also been implicated in this process. 

In the HPVs involved in cervical carcinoma such as HPV 16 and 18, the 
15 oncogenic process starts after integration of viral DNA. The integration results in the 
inactivation of genes coding for the capsid proteins LI and L2 and loss of E2 
repressor function leads to deregulation of the E6/E7 open reading frame installing 
continuously overexpression of the two early proteins E6 and E7 that will lead to 
gradually loss of the normal cellular differentiation and the development of the 
20 carcinoma. E6 and E7 overcome normal cell cycle by inactivating major tumor 
suppressor proteins, p53 and pRB, the retinoblastoma gene product, respectively. 

Carcinoma of the cervix is conmion in women and develops through a pre- 
cancerous intermediate stage to the invasive carcinoma which frequently leads to 
death. The intermediate stages of the disease is known as cervical intraepithelial 
25 neoplasia and is graded I to III in terms of increasing severity ( CIN I-III), 

Clinically, HPV infection of the female anogenital tract manifests as cervical 
flat condylomas, the hallmark of which is the koilocytosis affecting predominantly the 
superficial and intermediate ceils of the cervical squamous epithelium. 

Koilocytes which are the consequence of a cytopathic effect of the virus, 
30 appear as multinucleated cells with a perinuclear clear haloe. The epithelium is 
thickened with abnormal keratinisation responsible for the warty appearance of the 
lesion. 
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Such flat condylomas when positive for the HPV 16 or 18 serotypes, are high- 
risk factors for the evolution toward cervical intraepithelial neoplasia (CIN) and 
carcinoma in situ (CIS) which are themselves regarded as precursor lesions of 
invasive cervix carcinoma. 
5 The natural history of oncogenic HPV infection presents three consecutive 

phases, namely: 

( 1) a latent infection phase, 

(2) a phase of intranuclear viral replication with product of complete virions, which 
corresponds to the occurrence of koilocytes. At this stage, the HPV is producing its 

1 0 full range of proteins including E2, E5, E6, E7, L 1 and L2. 

(3) a phase of viral integration into the cellular genome, which triggers the onset of 
malignant transformation, and corresponds to CIN II and CIN Ill/CIS with 
progressive disappearance of koilocytes. At this stage, the expression of E2 is down- 
regulated, the expression of E6 and E7 is enhanced. Between CIN II/III and CIN III / 

15 Cervix carcinoma the viral DNA changes from being episomal in the basal cells to 
integration of E6 and E7 genes only (tumoral cells). 85% of all cervix carcinomas are 
squamous cell carcinpmas most predominantly related to the HPV 16 serotype. 10% 
and 5% are adenocarcinomas and adenosquamous cell carcinomas respectively, and 
both types are predominantly related to HPV 18 serotype. Nevertheless other 

20 oncogenic HPV's exist 

International Patent Application No. WO 96/19496 discloses variants of 
human papilloma virus E6 and E7 proteins, particularly fusion proteins of E6/E7 with 
a deletion in both the E6 and E7 proteins. These deletion fusion proteins are said to 
be immunogenic. 

25 Immunomodulatory oligonucleotides contain unmethylated CpG dinucleotides 

("CpG") and are known (WO 96/02555, EP 468520). CpG is an abbreviation for 
cytosine-guanosine dinucleotide motifs present in DNA. Historically, it was observed 
that the DNA fraction of BCG could exert an anti-tumour effect. In fiirther studies, 
synthetic oligonucleotides derived from BCG gene sequences were shown to be 

30 capable of inducing immunostimulatory effects (both in vitro and in vivo). The 
authors of these studies concluded that certain palindromic sequences, including a 
central CG motif, carried this activity. The central role of the CG motif in 
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immunostimuiation was later elucidated in a publication by Krieg, Nature 374, p546 
1995. Detailed analysis has shown that the CG motif has to be in a certain sequence 
context, and that such sequences are common in bacterial DNA but are rare in 
vertebrate DNA. 

5 It is currently believed that this evolutionary difference allows the vertebrate 

immime system to detect the presence of bacterial DNA (as occurring during an 
infection) leading consequently to the stimulation of the immune system. The 
immimostimulatory sequence as defined by Krieg is: 

Purine Purine CG pyrimidine pyrimidine and where the CG motif is not 

10 methylated. In certain combinations of the six nucleotides a palindromic sequence is 
present. Several of these motifs, either as repeats of one motif or a combination of 
different motifs, can be present in the same oligonucleotide. The presence of one or 
more of these immunostimulatory sequence containing oligonucleotides can activate 
various immune subsets, including natural killer cells (which produce interferon y and 

15 have cytolytic activity) and macrophages (Wooldrige et al Vol 89 (no. 8), 1977). 
Although other unmethylated CpO containing sequences not having this consensus 
sequence have now been shown to be immunomodulatory. 

The present invention provides compositions comprising either an E6 or/and 
E7 or an E6/E7 fusion protein optionally linked to an immunological fusion partner 

20 having T cell epitopes, and adjuvanted with an immunomodulatory CpG containing 
oligonucleotide. 

In a preferred form of the invention, the immunological fusion parmer is 
derived from protein D of Heamophilus influenza B. Preferably the protein D 
derivative comprises approximately the first 1/3 of the protein, in particular 

25 approximately the first N-terminal 100-1 10 amino acids. The protein D may be 
lipidated (Lipo Protein D). Other immunological fusion partners include the non- 
structural protein from influenzae virus, NSl (hemagglutinin). Typically the N 
terminal 81 amino acids are utilised, although different fragments may be used 
provided they include T-helper epitopes. 

30 In another embodiment the immunological fusion partner is the protein known 

as LYTA. Preferably the C terminal portion of the molecule is used. Lyta is derived 
from Streptococcus pneumoniae which synthesize an N-acetyl-L-alanine amidase. 
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amidase LYTA, (coded by the lytA gen {Gene. 43 (1986) page 265-272} an autolysin 
that specifically degrades certain bonds in the peptidoglycan backbone. The C- 
terminal domain of the LYTA protein is responsible for the affinity to the choline or 
to some choline analogues such as DEAE. This propert>' has been exploited for the 
5 development of Exoli C-LYTA expressing piasmids useful for expression of fusion 
proteins. Purification of hybrid proteins containing the C-LYTA fragment at its 
amino terminus has been described {Biotechnology: 10. (1992) page 795-798}. As 
used herein a preferred embodiment utilises the repeat portion of the Lyta molecule 
found in the C terminal end starting at residue 178. A particularly preferred form 
10 incorporates residues 1 88 - 305. 

Accordingly, the present invention in preferred embodiment provides 
compositions comprising an immunomodulatory CpG oligonucleotide and a fusion 
proteins comprising Protein D - E6 from HPV 16, Protein D - E7 from HPV 16 
Protein D - E7 from HPV 18, Protein D - E6 from HPV 18, and Protein D E6 E7 from 
15 both HPV 16 and 18. The protein D part preferably comprises the first 1/3 of protein 
D. It will be appreciated that other E6 and E7 proteins may be utilised from other 
HPV subtypes. 

The proteins utilised in the present invention preferably are expressed in E. 
coli. In a preferred embodiment the proteins are expressed with a Histidine tail 
20 comprising between 5 to 9 and preferably six Histidine residues. These are 
advantageous in aiding purification. 

The protein E7 may in a preferred embodiment carry one or several 
mutations in the binding site for the rb (retinoblastoma gene product) and hence 
eliminate any potential transforming capacity. Preferred mutations for HPV 16 E7 
25 involve replacing Cys34 with Glycine, or Glutamic acid,^ with Glutamine. In a 
preferred embodiment the E7 protein contains both these mutations. 

Preferred mutations for the HPV 18 E, involve replacing Cys., with Glycine 
and/or Glutamic acid.^with Glutamine. Again preferably both mutations are present. 
Single or double mutations may also be inu*oduced p53 region of E^ to 
30 eliminate any potential transforming ability. 
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In a further embodiment of the invention there is provided and E6 E7 fusion 
protein from HPV linked to an immunological fusion partner and a CpG 
immunomodulatory oligonucleotide. 

The vaccine of the present invention preferentially induce a THl immune 
5 response. 

Two main types of Helper T cells have been characterized THl and TH2, 
which differ in the type of cytokines they secrete. These cytokines can be considered 
as the driving force behind the development of 2 different types of immune response : 
THl -type of immune response is associated with ceil mediated effector mechanisms 
10 such as production of the INF-y and IL-2 cytokines by T-lymphocytes. INF-y which in 
turn can activate other cells and induce them to secrete other important cytokines 
and mediators (INF-y - activated NK cells produce IL12, IL2-activated NK cells are 
transformed into lymphokine activated killer cell ( LAK), INF-y-activated 
macrophages secrete inflamatory mediators like TNFa, IL I, IL 6 and release nitric 
15 oxyde , IL2 can provide help for the differentiation of antigen specific, haplotype 
restricted cytotoxic T lymphocytes (CTL). At the antibody level , in mice, Thl-type 
of immune response is also associated with the generation of antibodies of the IgG2 
isotype (IgG2a in Balb/c mice and IgG2b in C57BL/6 mice) . 

The Th2-type of immune response is associated with a humoral immune 
20 response to the antigen, with the production of cytokines like IL4, IL5, IL6, IL 1 0 and 
by the generation of a broad range of immunoglobulin isotypes including in mice 
IgGUlgA, and IgM. 

In man the distinction of Thl and Th2-type immune responses is not absolute. 
An individual will support an immune response which is predominantly Thl or 
25 predominantly Th2. However, it is often convenient to consider the families of 
cytokines in terms of that described in murine CD4 +ve T cell clones by Mosmann 
and Coffman {Mosmann, T.R. and Coffman, R.L (1989) THl and TH2 cells: different 
patterns of lymphokine secretion lead to different functional properties. Annual 
Review of Immunology, 7. pi 45-1 73). 
30 In the human THl type of response is also associated with the presence of 

cytokine (IFNg and IL2) evenmaliy with the presence of CTl and IgG2 isotypes in 
mice correspond to IgGl type antibodies 
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This type I phenotype is of particular importance in protecting against viral 
and intracellular bacterial infections as well as in the treatment of cancer. 

To manufacture the proteins used in the invention by recombinant techniques, 
an expression strategy can be used which involves fusion of E7, E6 or E6/E7 fusion to 
5 the 1/3-N-terminal portion of protein D from Haemophilus influenzae B, an 
immunological fusion panner providing T cell helper epitopes. An affinity 
poiyhistidine tail is engineered at the carboxy terminus of the fusion protein allowing 
for simplified purification. Such recombinant antigen is overexpressed in E. coli as 
insoluble protein. 

1 0 The proteins of the invention my be coexpressed with thioredoxin in trans 

(TIT). Coexpression of thioredoxin in trans versus in cis is preferred to keep antigen 
free of thioredoxin without the need for protease. Thioredoxin coexpression eases the 
solubilisation of the proteins of the invention. Thioredoxin coexpression has also a 
significant impact on protein purification yield, on purified-protein solubility and 

15 quality. 

The replicable expression vectors may be prepared in accordance with the 
invention, by cleaving a vector compatible with the host cell to provide a linear DNA 
segment having an intact replicon, and combining said linear segment with one or 
more DNA molecules which, together with said linear segment encode the desired 
20 product, such as the DNA polymer encoding the protein of the invention, or derivative 
thereof, under Ugating conditions. 

Thus, the DNA polymer may be preformed or formed during the construction 

of the vector, as desired. 

The choice of vector will be determined in part by the host cell, which may be 

25 prokaryotic or eukaryotic but preferably is E. coli. Suitable vectors include plasmids, 
bacteriophages, cosmids and recombinant viruses. 

The preparation of the replicable expression vector may be carried out 
conventionally with appropriate en2ymes for restriction, polymerisation and ligation 
of the DNA, by procedures described in, for example, Maniatis et al. cited above. 

30 The recombinant host cell is prepared, in accordance with the invention, by 

transforming a host cell with a replicable expression vector of the invention under 
transforming conditions. Suitable transforming conditions are conventional and are 
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described in, for example, Maniaiis et ai cited above, or ''DNA Cloning*' Vol. II, 
D.M. Glover ed., IRL Press Ltd, 1985. 

The choice of transforming conditions is determined by the host cell. Thus, a 
bacterial host such as E. coli may be treated with a solution of CaCl, (Cohen et ai, 

5 Proc. Nat. Acad. Sci., 1973, (5P, 21 10) or with a solution comprising a mixture of 
RbCl, MnCK, potassium acetate and glycerol, and then with 3-[N-morpholino]- 
propane-suiphonic acid, RbCl and glycerol. Mammalian ceils in cuhure may be 
transformed by calcium co-precipitation of the vector DNA onto the cells. The 
invention also extends to a host cell transfora:ied with a replicable expression vector of 

10 the invention. 

Culturing the transformed host cell under conditions permitting expression of 
the DNA polymer is carried out conventionally, as described in, for example, Maniatis 
et ai and "DNA Cloning" cited above. Thus, preferably the cell is supplied with 
nutrient and cultured at a temperature below 50°C. 
15 The product is recovered by conventional methods according to the host cell. 

Thus, where the host cell is bacterial, such as E. coli it may be lysed physically, 
chemically or enzymaticaliy and the protein product isolated from the resulting lysate. 
Where the host cell is manmialian, the product may generally be isolated from the 
nutrient medium or from cell free extracts. Conventional protein isolation techniques 
20 include selective precipitation, adsorption chromatography, and affmity 
chromatography including a monoclonal antibody affmity column. 

When the proteins of the present invention are expressed with a hisitidine tail 
(His tag). The proteins can easily be purified by affinity chromatography using an ion 
metal affmity chromatography column (I MAC) column. 
25 A second chromatographic step, such as Q-sepharose may be utilised either 

before or after the IMAC column to yield highly purified protein. If the 
immunological fusion partner is C-LYTA. then it is possible to exploit the affinity of 
CLYTA for choline and/or DEAE to punfy this product. Products containing both 
C-LYTA and his tags can be easily and efficiently purified in a two step process 
30 involving differential affinity chromatography. One step involves the affinity of the 
His tag to IMAC columns, the other involves the affinity of the C-terminal domain of 
LYTA for choline or DEAE. 
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A preferred vaccine composition comprises at least Protein D - E6 from HPV 
16 or derivative thereof together with Protein D - E7 from HPV 16. Alternatively the 
E6 and E7 may be presented in a single molecule, preferably a Protein D E6/E7 
fiision. Such vaccine may optionally contain either or both E6 and E7 proteins from 
5 HPV 1 8, preferably in the form of a Protein D - E6 or Protein D - E7 fusion protein 
or Protein D E6/E7 fusion protein. The vaccines of the present invention may contain 
other HPV antigens from HPV 16 or 18. In particular, the vaccine may contain LI or 
L2 antigen monomers. Alternatively such LI or L2 antigens may be presented 
together as a virus like particle or the LI alone protein may be presented as virus like 
10 particle or caposmer structure. Such antigens, virus like panicles and capsomer are 
per se known. See for example WO94/00152, WO94/20137, WO94/05792, and 
WO93/02184. Additional early proteins may be included such as E2 or preferably E5 
for example The vaccine of the present invention may additionally comprise antigens 
from other HPV strains, preferably from strains HPV 6, 1 1, 31 or 33. 
1 5 Vaccine preparation is generally described in Vaccine Design - The subunit 

and adjuvant approach (Ed. Powell and Newman) Pharmaceutical Biotechnology Vol. 
6 Plenum Press 1995. Encapsulation within liposomes is described by Fullerton, US 
Patent 4,235,877. 

The preferrred oligonucleotides preferably contain two or more CpG motifs 
20 separated by six or more nucleotides. The oligonucleotides of the present invention 
are typically deoxynucleotides. in a preferred embodiment the intemucleotide in the 
oligonucleotide is phosphorodithioate, or more preferably a phosphorothioate bond, 
although phosphodiester and other intemucleotide bonds are within the scope of the 
invention including oligonucleotides with mixed intemucleotide linkages. 
25 Preferred oligonucleotides have the following sequences: The sequences 

preferably contain all phosphorothioate modified intemucleotide linkages. 
OLIGO 1 : TCC ATG ACG TTC CTG ACG TT 
OLIGO 2: TCT CCC AGC GTG CGC CAT 

OLIGO 3: ACC GAT GAC GTC GCC GGT GAG GGC ACG ACG 
30 The CpG oligonucleotides utilised in the present invention may be synthesized 

by any method known in the art (eg EP 468520). Convemently, such oligonucleotides 
may be synthesized utilising an automated synthesizer. Methods for producing 
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phosphorothioate oligonucleotides or pliosphorodithioate are described in 
US5,666,153, US5,278,302 and WO95/26204. 

The invention will be further described by reference to the following 
examples: 

5 EXAMPLE I: Construction of an E. coli strain expressing fusion Protein-Dl/3 - 
E7 -His (HPV16) 

1) - Construction of expression plasmid 

a) - Plasmid pMG MCS prot Dl/3 (- pRIT14589) is a derivative of pMG81 
(described in UK patent application n° 951 3261.9 published as WO97/01640) in 

10 which the codons 4-8 1 of NSl coding region from Influenza were replaced by the 
codons corresponding to residues Ser 20 -> Thrl27 of mature protein D of 
Haemophilus Influenzae strain 772, biotype 2 (H. Janson et al., 1991. Infection and 
Immunity, Jan. p.l 19-125). The sequence of Prot-Dl/3 is followed by a multiple 
cloning site (1 1 residues) and a coding region for a C-terminal histidine tail (6 His). 

1 5 This plasmid is used to express the fusion protein D l/3-E7-His. 

b) - HPV genomic E6 and E7 sequences type HPV 16 (See Dorf et al. Virology 
1985, 145, p. 181-185) were amplified from HPV 16 full length genome cloned in 
pBR322 (obtained from Deutsches Krebsforschungszentrum (DKFZ), 

* Referenzzentrum fur human pathogen Papillomaviruses - D 69120 - Heidelberg) and 
20 were subcloned into pUC19 to give TCA 301 (= pRIT14462). 

Construction of plasmid TCA 308 (= pRIT1450l): a plasmid expressing the 
fusion Protein-Dl/3-E7-His 

The nucleotides sequences corresponding to amino acids 1 -> 98 of E7 
protein are amplified from pRIT14462. During the polymerase chain reaction, Ncol 
25 and Spel restriction sites were generated at the 5' and 3' ends of the E7 sequences 

allowing insertion into the same sites of plasmid pMGMCS Prot Dl/3 to give plasmid 
TCA308 (= pRITt4501). The insert was sequenced to verify that no modification had 
been generated during the polymerase chain reaction. The sequence for the fusion 
protein-Dl/3-E7-His (HPV 16) is described in sequence ID No. I and the coding 
30 sequence in ID No. 2. 

2) - Transformation of AR58 strain 
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Plasmid pRIT14501 was introduced into E. coU AR58 (Men ei ai. 1985, Proc. 
Natl. Acad. Sci., 82:88) a defective X lysogen containing a thennosensitive repressor 
of the A. pL promoter. 

3) - Growth and induction of bacterial strain - Expression of Prot -Dl/3-E7-His 

5 Ceils of AR58 transformed with plasmid pRIT14501 were grown in 100 ml of 

LB medium supplemented with 50 |igr/ml of Kanamycin at SC^C. During the 
logarithmic phase of growth bacteria were shifted to 39°C to inactivate the X repressor 
and turn on the synthesis of protein Dl/3-E7-His. The incubation at 39°C was 
continued for 4 hours. Bacteria were pelleted and stored at -20°C. 

10 EXAMPLE 11: Construction of an E.coli strain expressing fusion Protein-Dl/3- 
E6-his / HPV16 

1. Construction of expression plasmid 

a) Plasmid pMG MCS protDl/3 (= pRIT14589) is a derivative of pMGSl (described 
in WO97/01640 in which the codons 4-81 of NSl coding region from Influenza were 

15 replaced by the codons corresponding to residues Ser 20 ^ Thr 127 of mauire protein 
D of Haemophilus Influenzae strain 772. biotype 2 (H. Janson et a/., 1991 , Infection 
and Immunity, Jan. p.ll9-125). The sequence of Prot-Dl/3 is followed by a multiple 
cloning site (1 1 residues) and a coding region for a C-terminal histidine tail (6 His). 
This plasmid is used to express the fusion protein Dl/3-E6-his. 

20 h ) HPV genomic E6 and E7 sequences type HPVj6 (Seedorf et ai , Virology 1985, 
145, P.181-1S5) were amplified from HPV16 ftill length genome cloned in pBR322 
(obtained from Deutsches Krebsforschungszentrum (DKFZ), Referenzzentrum fUr 
human pathogen Papillomaviruses - 

c) D 69120 - Heidelberg), and were subcloned into pUC19 to give TCA 301 (= 
25 pRITl4462). 

Construction of plasmid TCA 307 (=pRIT14497) : a plasmid expressing the 
fusion Protein.Dl/3-E6-His /HPV16 

The nucleotides sequences corresponding to amino acid. 
1 -> 151 of E6 protein were amplified from pRIT 14462. Durmg the 
30 polymerase chain reaction. Ncol and Spel restriction sites were generated at the 5' an 
3' ends of the E6 sequences allowing insertion into the same sites of plasmid 
pMGMCS Prot Dl/3 to give plasmid TCA307 (= pRIT14497). The msert was 
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sequenced to verify that no modification had been generated during the polymerase 
chain reaction. The protein and coding sequence for the fusion protein-Dl/3-E6-His 
is described in sequence ID No. 3 and 4. 

2. Transformation of AR58 strain 

5 Plasmid pRIT14497 was introduced into E. coli AR58 (Mott et ai., 1985, Proc. 

Natl. Acad. Sci., 82:88) a defective X lysogen containing a thermosensitive repressor 
of the A, pL promoter. 

3. Growth and induction of bacterial strain - Expression of Prot-Dl/3-E6-His 

Cells of AR58 transformed with plasmid pRlT 14497 were grown in 100 ml of 
10 LB medium supplemented with 50 ^igr/ml of Kanamycin at 30'C. During the 
logarithmic phase of growth bacteria were shifted to 39°C to inactivate the X repressor 
and turn on the synthesis of protein Dl/3-E6-his. The incubation at 39°C was 
continued for 4 hours. Bacteria were pelleted and stored at -20C, 

4. Characterization of fusion Protein Dl/3-E6-his (HPV 16) 
15 Preparation of extracts 

Frozen cells are thawed and resuspended in 10 ml of PBS buffer. Cells are 
broken in a French pressure cell press SLM Anainco at 20.000 psi (three passages). 
The extract is centrifuged at 16.000 g for 30 minutes at 4*'C. 
Analysis on Cooniassie-stained SDS-polyacrylaraide gels and Western blots 
20 After centrifugation of extracts described above, aliquots of supernatant and 

pellet were analysed by SDS-polyacrylamide gel electrophoresis and Western 
blotting. 

A major band of about 32 kJDa, localized in the pellet fraction, was visualised 
by Coomassie stained gels and identified in Western blots by rabbit polyclonal anti- 
25 protein-D and by Ni-NTA conjugate coupled to calf intestinal alkaline phosphatase 
(Qiagen cat. n° 345 10) which detects accessible histidine tail. The level of expression 
represents about 5 % of total protein. 
5, Coexpression with thioredoxin 

In an analagons fashion to the expression of prot D 1/3 E7 His from HPV 18 
30 (example IX) an E.coli strain AR58 was transformed with a plasmid encoding 
thioredoxin and protein D 1/3 E7 His (HPV 16). 
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10 


EXAMPLE III: Construction of an E. coli strain expressing fusion Protein-Dl/S- 
E6E7-his / HPV16 
1. Construction of expression plasmid 

a) Plasmid pMG MCS protDl/3 (= pRIT14589) is a derivative of pMG81 (described 
Supra) in which the codons 4-81 of NSl coding region from Influenza were replaced 
by the codons corresponding to residues Ser 20 Thr 127 of mature protein D of 
Haemophilus Influenzae strain 772, bioiype 2 (H. Janson et ai, 199h Infection and 
Immunity, Jan. p.l 19-125). The sequence of Prot-Dl/3 is followed by a multiple 
cloning site (1 1 residues) and a coding region for a C-terminal histidine tail (6 His). 
This plasmid is used to express the fusion protein Dl/3-E6E7-his. 

b) HPV genomic E6 and E7 sequences type HPV16 (Seedorf et a/.. Virology 1985, 
145, p.l 8 1-185) were amplified from HPV16 ftili length genome cloned in pBR322 
(obtained from Deutsches Krebsforschungszentrum (DKFZ), Referenzzentrum fur 
human pathogen Papillomaviruses - D 69120 - Heidelberg) and were subcloned into 

15 pUC19 to give TCA 301 (= pRlTl4462). 

c) The coding sequences for E6 and E7 in TCA301 (= pRlT 

14462) were modified with a synthetic oligonucleotides adaptor (inserted between Afl 
III and Nsi I sites) introducing a deletion of 5 nucleotides between E6 and E7 genes 
to remove the stop codon of E6 and create fused E6 and E7 coding sequences in the 
20 plasmid TCA309( = pRIT 14556 ). 

Construction of plasmid TCA 3n(- pRIT14512) : a plasmid expressing the 
fusion Protein-Dl/3-E6E7-His /HPV16 

The nucleotides sequences corresponding to amino acids 1 -> 249 of fused 
E6E7 protein were amplified from pRIT14556. During the polymerase chain 
25 reaction, Ncol and Spel restriction sites were generated at the 5 ' and 3 ' ends of the 
E6E7 aised sequences allowing insertion into the same sites of plasmid pMGMCS 
ProtDl/3 to give plasmid TCA3 11 (= pRIT145l2). The insert was sequenced to 
verify that no modification had been generated during the polymerase chain reaction. 
The protein and coding sequence for the fusion protein-D E6/E7 1/3-His is descnbed 
30 sequence ID No. 5 and 6. 

2. Transformation of AR58 strain 
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Piasmid pRIT14512 was introduced into E. coli AR58 (Mon ei al., 1985, Proc. 
Natl. Acad. Sci., 82:88) a defective X lysogen containing a thermosensitive repressor 
of the X pL promoter. 

3. Growth and induction of bacterial strain - Expression of Prot-Dl/3-E6E7-His 

5 Cells of AR58 transformed with piasmid pRIT14512 were grown in 100 mi of 

LB medium supplemented with 50 ngr/ml of Kanamycin at BO^'C. During the 
logarithmic phase of growth bacteria were shifted to 39°C to inactivate the \ repressor 
and turn on the synthesis of protein Dl/3-E6E7-his. The incubation at 39°C was 
continued for 4 hours. Bacteria were pelleted and stored at -20C. 

10 4. Characterization of fusion Protein Dl/3-E6E7-his 

Frozen cells are thawed and resuspended in 10 ml of PBS buffer. Cells are 
broken in a French pressure cell press SLM Aminco at 20.000 psi (three passages). 
The extract is centriftiged at 16.000 g for 30 minutes at 4*'C. 

After centrifligation of extracts described above, aiiquots of supernatant and 

15 pellet were analysed by SDS-poiyacrylamide gel electrophoresis and Western 
blotting, 

A major band of about 48 kDa, localized in the pellet fraction, was visualised 
by Coomassie stained gels and identified in Western blots by rabbit polyclonal anti- 
protein-D and by Ni-NTA conjugate coupled to calf intestinal alkaline phosphatase 
20 (Qiagen cat. n° 345 1 0) which detects accessible histidine tail. The level of expression 
represents about 1 % of total protein. 
EXAMPLE: IV 

In an analagous fashion the fusion protein of Lipo D 1/3 and E6-E7 from 
HPV16 was expressed in Exoli in the presence of thioredoxin. 

25 The N-terminal of the prc-protein (388 aa) contains MDP residues followed by 16 
amino acids of signal peptide of lipoprotein D (from Haemophilus Influenzae) which 
is cleaved in vivo to give the mature protein (370 aa). Lipoprotein portion (aa 1 to 
127) is followed by the proteins E6 and E7 in fusion. The C terminal of the protein is 
elongated by TSGHHHHHH. 

30 EXAMPLE V: Construction of E.coli strain B1002 expressing fusion ProtDl/3- 
E7 

Mutated (cys24->gly,glu26->gln ) type HPV16 
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l)-Construction of expression plasmid 
Starting material: 

a) - Plasmid pRIT 14501 ( = TCA 308) which codes for fusion ProtDl/3-E7 -His 

b) - Plasmid LITMUS 28 ( New England Biolabs cat n° 306-28 ) , a cloning vector 
pUC-derived 

c) - Plasmid pMG MCS ProtDi/3 ( pRlT 14589) , a derivative of pMG81 (described 
Supra) in which the codons 4-8 1 of NS I coding region from Influenza were replaced 
by the codons corresponding to residues Ser 20 Thr 127 of mature protein D of 
Haemophilus Influenzae strain 772, bioiype 2 (H. Janson et ai, 1991, Infection and 
Immunity, Jan. p. 1 19-125). The sequence of Prot-Dl/3 is followed by a multiple 
cloning site (11 residues) and a coding region for a C-terminal histidine tail (6 His) 
Construction of plasmid pRIT 14733( ='TCA347): a plasmid expressing the 
fusion Protein-Dl/3-E7 mutated ( cys24->gly ,glu26->gln) with His tail 

The Ncol - Xbal fragment from pkIT 14501 (-TCA 308 ), bearing the coding 
sequence of E7 gene from HPV16 , elongated with an His tail , was subcioned in an 
intermediate vector Litmus 28 useful for mutagenesis to give pRIT 14909 (=TCA337) 
Double mutations cys24->gly ( Edmonds and Vousden , J.Virology 63 : 2650 (1989) 
and glu26->gln ( Phelps et al , J.Virology 66: 2418-27 ( 1992 ) were chosen to 
impair the binding to the antioncogene product of Retinoblastome gene ( pRB ). 
20 The introduction of mutations in E7 gene was realized with the kit " Quick Change 
Site directed Mutagenesis ( Stratagene cat n° 200518) to give plasmid pRIT 
14681(=TCA343 ) .After verification of presence of mutations and integnty of the 
complete E7 gene by sequencing , the mutated E7 gene was introduced into vector 
pRIT 14589 ( = pMG MCS ProlDl/3 ) to give plasmid pRIT 14733 (=TCA347) 
25 protein and coding sequence. 

The sequence for the fusion protein-Dl/3-E 7 mutated ( cys24->gly, glu26- 
>gln ) -His is described in sequence ID No. 7 and 8. 

2)-Construction of strain B1002 expressing ProtDl/3-E7mutated (cys 24->gly , 
glu26-->glii )-His /HPV16 

30 Plasmid pRlT 14733 was introduced into ExoH AR58 ( Mott et al. .1985, 
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Proc. Natl. Acad, Sci. , 82:88 ) a defective k lysogen containing a thermosensitive 
repressor of the X pL promoter ,to give strain B1002 , by selection for transformants 
resistant to kanamycine 

3) -Growth and induction of bacterial strain B1002 - Expression of ProtDl/3-E7 
5 mutated (cys 24->gly , glu26->gln )-His /HPV16 

Cells of AR58 transformed with plasmid pRJT 14733 ( B1002 strain ) were 
grown at 30°C in 100 ml of LB medium supplemented with 50 \xgT /ml of Kanamycin. 
During the logarithmic phase of growth bacteria were shifted to 39°C to inactivate the 
X repressor and turn on the synthesis of ProtDl/3-E7 mutated -His /HPV16 . The 
10 incubation at 39°C was continued for 4 hours . Bacteria were pelleted and stored at - 
20X- 

4) -Characterization of fusion ProtDl/3-E7 mut (cys24->gly, glu26->gln)- His type 
HPV16. 

Frozen cells were thawed and resuspended in 10 ml of PBS buffer.Cells were 
15 broken in a French Pressure cell press SLM Aminco at 20 000 psi ( three passages) . 
The extract was centrifuged at 16000 g for 30 minutes at 4''C. 

After centrifiigation of extracts described above, aliquots of supernatant and 
pellet were analysed by SDS-polyacrylamide gel electrophoresis and Western 
blotting. 

20 A major band of about 33 kDa, localized in the pellet fraction, was visualised 

by Coomassie stained gels and identified in Western blots by rabbit polyclonal 22 J 70 
anti-protein D, by monoclonal anti E7 /HPV16 from Zymed and by Ni-NTA 
conjugate coupled to calf intestinal alkaline phosphatase (Qiagen cat. n** 34510) which 
detects accessible histidine tail. The level of expression represents about 3 to 5 % of 

25 total protein. 

Cells of B 1002 were separated from the culture broth by centrifugation. 
The concentrated cells of B 1002 were stored at -65°C. 

EXAMPLE VI: Construction of an £. co// strain expressing fusion ciyta-E6- 
his (HPV 16) 
30 1. Construction of expression plasmid 

a) -Plasmid pRJT 14497 (= TCA307), that codes for fusion ProtDl/3-E6-His /HPV 16 
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b)-Plasmid pRITl466l (= DVA2), an intermediate vector containing the coding 
sequence for the 1 17 C-terminai codons of LytA of Streptococcus Pneumoniae. Lyta 
is derived from Streptococcus pneumoniae which synthesize an N-acetyl-L-alanine 
amidase, amidase LYTA. (coded by the lytA gene {Gene, 43 (1986) pag 265-272} an 
5 autolysin that specifically degrades certain bonds in the peptidoglycan backbone . 
The C-terminal domain of the LYTA protein is responsible for the affinity to the 
choline or to some choline analogues such as DEAE. 

Lb Construction of plasmid pRIT14634 (=TCA332): a plasmid expressing the 
fusion clyta-E6-His /HPV16 

10 a)The first step v^^as the purification of the large Ncol-Aflll restriction fragment from 
plasmid pRIT 14497 and the purification of the small Aflll-Afllll restriction fragment 
from pRIT 14661 

b)The second step was linking of clyta sequences to the E7-His sequences (Ncol and 
Afllll are compatible restriction sites) that gave rise to the plasmid pRJT 14634 
15 (=TCA332), coding for the fusion protein clyta-E6-His under the control of the pL 
promoter. 

The protein and coding sequence for the fusion protein clyta-E6-His is described 
sequence ID No. 9 and 10. 
Transformation of AR58 strain 
20 Plasmid pRITl4634 was introduced into E. coli AR58 (Mott et al., 1985, Proc. 

Natl. Acad. Sci., 82:88) a defective X lysogen containing a thermosensitive repressor 
of the X pL promoter. 

Growth and induction of bacterial strain - Expression of clyta-E6-His 

Cells of AR58 transformed with plasmid pRIT 14634 were grown in 100 ml of 
25 LB medium supplemented with 50 ^g^/ml of Kanamycin at 30°C. During the 

logarithmic phase of growth bacteria were shifted to 39°C to inactivate the X repressor 
and turn on the synthesis of protein clyta-E6-his. The incubation at 39°C was 
continued for 4 hours. Bacteria were pelleted and stored at -20°C. 
4. Characterization of fusion clyta-E6-his 
30 Frozen cells were thawed and resuspended in 10 ml of PBS buffer. Cells were 

broken in a French pressure cell press SLM Aminco at 20.000 psi (three passages). 
The extract was centrifuged at 16.000 g for 30 minutes at 4°C. After cenirifugation of 
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extracts described above, aliquots of supernatant and pellet were analysed by SDS- 
polyacrylamide gel electrophoresis and Western blotting. 
A major band of about 33 kDa, localized in the pellet fraction, was visualised by 
Coomassie stained gels and identified in Western blots by rabbit polyclonal anti-clyta 
5 antibodies and by Ni-NTA conjugate coupled to calf intestinal alkaline phosphatase 
(Qiagen cat. n° 34510) which detects accessible histidine tail. The level of expression 
represents about 3 % of total protein. 

EXAMPLE VII: Construction of an E. coli strain expressing fusion clyta-E7-his 
(HPV 16) 

10 1 . Construction of expression plasmid 
l.a Starting materials 

a) -Plasmid pRIT14501 (= TCA308), that codes for fusion ProtDl/3-E7-His /HPV16 

b) -Plasmid pRITl4661 (= DVA2), an intermediate vector containing the coding 
sequence for the 1 17 C-ierminai codons of LytA of Streptococcus Pneumoniae, 

15 l.b Construction of plasmid pRITK626 (^TCA330): a plasmid expressing the 
fusion ciyta-E7-His / HPV16 

a) The first step was the purification of the large Ncol-Aflll restriction fragment firom 
plasmid pRIT 14501 and the purification of the small Aflll-Afllll restriction firagment 
from pRlT 14661 

20 b) The second step was linking of clyta sequences to the E7-His sequences (Ncol and 
Afllll are compatible restriction sites) that gave rise to the plasmid pRIT 14626 
(=TCA330), coding for the fusion protein clyta-E7-His under the control of the pL 


25 decribed in sequence ID No. 11 and 12. 
2, Transformation of AR58 strain 

Plasmid pRIT14626 was introduced into E. coli AR58 (Mott et al., 1985, Proc. 
Natl. Acad. Sci., 82:88) a defective a lysogen containing a thermosensitive repressor 
of the K pL promoter. 
50 3. Growth and induction of bacterial strain - Expression of clyta-E7-His 

Cells of AR58 u-ansfonned with plasmid pRIT 14626 were grown in 100 ml of 
LB medium supplemented with 50 ngr/ml of Kanamycin at 30°C. During the 


promoter. 


The protein and coding sequence for the fusion protein clyta- E7-His is 
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logarithmic phase of growth bacteria were shifted to 39°C to inactivate the k repressor 
and turn on the synthesis of protein clyta-E7-his. The mcubation at 39''C was 
continued for 4 hours. Bacteria were pelleted and stored at -20°C. 
4. Characterization of fusion clyta-E7-his 

5 Frozen cells were thawed and resuspended in 10 ml of PBS buffer. Cells were 

broken in a French pressure cell press SLM Aminco at 20.000 psi (three passages). 
The extract was centrifuged at 16.000 g for 30 minutes at 4°C. After centrifugation of 
extracts described above, aliquots of supernatant and pellet were analysed by SDS- 
polyacrylamide gel electrophoresis and Western blotting. 

10 A major band of about 35 kDa, localized in the pellet fraction, was visualised by 
Coomassie stained gels and identified in Western blots by rabbit polyclonal anti-clyta 
antibodies and by Ni-NTA conjugate coupled to calf intestinal alkaline phosphatase 
(Qiagen cat. n'' 34510) which detects accessible histidine tail. The level of expression 
represents about 5 % of total protein. 

1 5 EXAMPLE VIII: Construction of an £. coii strain expressing fusion clyta-E6E7- 
his (HPV 16) 

1. Construction of expression plasmid 
l.a Starting materials 

a) -Plasmid pRIT14512 (= TCA3U), that codes for fusion ProtDl/3-E6E7-His 
20 /HPV 16 

b) -Plasmid pRlT14661 (= DVA2), an intermediate vector containing the coding 
sequence for the 1 17 C -terminal codons of LytA of Streptococcus Pneumoniae. 

l.b Construction of plasmid pRIT14629 (=TCA331): a plasmid expressing the 
fusion clyta-E6E7-His /HP VI 6 
25 a)The first step was the purification of the large Ncol-Aflll restriction fragment from 
plasmid pRlT14512 and the purification of the small Aflll-Anill restriction fragment 
from pRIT14661 

b)The second step was linking of clyta sequences to the E7-His sequences (Ncol and 
Afllll are compatible restriction sites)that gave rise to the plasmid pRlT 14629 
30 ('^TCASSl), coding for the fusion protein clyta-E6E7-His under the control of the pL 
promoter. 
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The protein and coding sequence for the fusion protein clyta-E6E7-His is 
sequenced ID No. 13 and 14. 

2. Transformation of AR58 strain 

Plasmid pRITl4629 was introduced into E. coli AR58 (Mott et al., 1985. Proc. 
5 Natl. Acad. Sci., 82:88) a defective X lysogen containing a thermosensitive repressor 
of the X pL promoter. 

3. Growth and induction of bacterial strain - Expression of clyta-E6E7-His 

Cells of AR58 transformed with plasmid pRlT 14629 were grown in 100 ml of 
LB medium supplemented with 50 |igr/ml of Kanamycin at 30'C. During the 
10 logarithmic phase of growth bacteria were shifted to 39*'C to inactivate the X repressor 
and turn on the synthesis of protein clyta-E6E7-his. The incubation at 39°C was 
continued for 4 hours. Bacteria were pelleted and stored at -20*'C. 

4. Characterization effusion ciyta-E6E7-his 

Frozen cells were thawed and resuspended in 10 ml of PBS buffer. Cells were 
15 broken in a French pressure cell press SllM Aminco at 20.000 psi (three passages). 
The extract was centrifuged at 16.000 g for 30 minutes at 4'*C. 

After centrifugation of extracts described above, aliquots of supernatant and 
pellet were analysed by SDS-polyacrylamide gel electrophoresis and Western 
blotting. 

20 A major band of about 48 kDa, localized in the pellet fraction, was visualised 

by Coomassie stained gels and identified in Western blots by rabbit polyclonal anti- 
ciyta antibodies and by Ni-NTA conjugate coupled to calf intestinal alkaline 
phosphatase (Qiagen cat. n° 34510) which detects accessible histidine tail. The level 
of expression represents about I % of total protein. 
25 EXAMPLE IX: Prot Dl/3 E7 his (HPV 18) (E.CoU BlOll) 

Protein Dl/3 E7 his HPV expressed with Thiorcdoxin inTrans (E.CoU B1012) 
1) - Construction of expression plasmids 

l).a,Construction of plasmid TCA316(=pRIT 14532> a plasmid 
expressing the fusion Protein-Dl/3-E7-His /HP VI 8 
30 Starting materials 

a) - Plasmid pMG MCS prot Dl/3 (= pRIT14589) is a derivative of pMG81 
(described in UK patent application n° 951 3261 .9 published as WO97/01640 in 
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which the codons 4-81 of NSl coding region from influenza were replaced by the 
codons corresponding to residues Ser 20 Thr 127 of mature protein D of 
Haemophilus Influenzae strain 772, biotype 2 (H. Janson et ai., 1991. Infection and 
Immunity, Jan. p. 1 19-125). The sequence of Prot-Di/3 is followed by a multiple 
5 cloning site (1 1 residues) and a coding region for a C-terminal histidine tail (6 His). 
This plasmid is used to express the fusion protein Dl/3-E7-his. 
b) - HPV genomic E6 and E7 sequences of prototype HPV18 (Cole et 
al.J.Mol.Biol.(l 987)193,599-608) were amplified from HPV 16 full length genome 
cloned in pBR322 (obtained from Deutsche Krebsforschungszentrum (DKFZ), 
10 Referenzzentrum fur human pathogen Papillomaviruses - D 691 20 - Heidelberg) and 
were subcloned into pUC19 to give TCA 302 (= pRIT14467). 
Construction of plasmid TCA 316(= pRIT14532) 

The nucleotides sequences corresponding to amino acids 1 105 of E7 
protein were amplified from pRlT 14467. During the polymerase chain reaction, Ncol 
15 and Spel restriction sites were generated at the 5* and 3* ends of the E7 sequences 

allowing insertion into the same sites of plasmid pMGMCS Prot Dl/3 to give plasmid 
TCA3 16 (= pRIT14532). The insert was sequenced and a modification versus 
E7/HPV18 prototype sequence was identified in E7 gene (nucleotide 128 G->A) 
generating a substitution of a glycine by a glutamic acid (aa 43 in E7 , position 156 in 
20 fusion protein). The protein and coding sequence for the fusion protein-Dl/3-E7-His 
/HPV 18 is set forth in sequence ID No. 15 and ID No. 16. 

l).b. Construction of plasmid TCA313 (=pRIT14523): a plasmid 
expressing thioredoxin 
Starting materials 

25 a) - Plasmid pBBRlMCS4 (Antoine R. and C.Locht,Mol.Microbiol. 1992,6,1785-1799 
; M.E.Kovach et al. Biotechniques 16, (5), 800-802 )which is compatible with 
plasmids containing ColEl or PI 5a origins of replication. 

b) - Plasmid pMG42 (described in WO93/041 75) containing the sequence of promoter 
pL of Lambda phage 

30 c) - Plasmid pTRX ( Inviu-ogen, kit Thiofusion K350-01) bearing the coding sequence 
for thioredoxin followed by Asp A transcription terminator. 
Construction of plasmid TCA313(-pRIT14523) 
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The fragment EcoRJ-Ndel fragment from pMG42, bearing pL promoter and 
the Ndel-Hindlll fragment from pTRX, bearing the coding sequence for thioredoxin 
followed by AspA terminator were purified and ligated into the EcoRI and Hindlll 
sites of piasmid vector pBBRlMCS4 to give plasmid TCA313(= pRIT14523). 
5 The coding sequence for thioredoxin is described in ID No. 1 7. 

2) - Transformation of AR58 strain 

2).a. To obtain strain BlOll expressing ProtDl/3-E7-His/HPV18 
Plasmid pRIT14532 was introduced into E. coH AR58 (Mott et al, 1985. Proc. Natl. 
Acad. Sci., 82:88) a defective X lysogen containing a thermosensitive repressor of the 
10 A, pL promoter , by selection for transformants resistant to kanamycine. 

2).b. Construction of strain B1012 expressing ProtDl/3-E7-His/HPV18 
and thioredoxin 

Plasmid pRITl4532 and pRITl4523 were introduced into E. coli AR58 (Mott 
et al., 1985, Proc. Natl. Acad, Sci., 82:88) a defective \ lysogen containing a 
1 5 thermosensitive repressor of the X, pL promoter ,by double selection for transformants 
resistant to kanamycin and ampicillin. 

3) - Growth and induction of bacterial strains BlOll and B1012 - Expression 

of Prot-Dl/3-E7-His/HPV18 without and with thioredoxin in trans 

Cells of AR58 transformed vsrith plasmids pRIT14532 ( BlOl 1 strain) and 
20 Cells of AR58 transformed with plasmids pRIT14532 and pRIT14523 (B1012 strain) 
were grown at BO'c in 100 ml of LB medium supplemented with 50 |igr/ml of 
Kanamycin for B 1011 strain and supplemented 50 |igr/ml of Kanamycin and 100 
^gr/ml of Ampicillin for B1012 strain . During the logarithmic phase of growth 
bacteria were shifted to 39*'C to inactivate the X repressor and turn on the synthesis 
25 of protein Dl/3-E7-his/HPV18 and thioredoxin. The incubation at 39°C was 
continued for 4 hours. 

Characterization effusion Protein Dl/3-E7-his /HPV18 
Preparation of extracts 

Frozen cells are thawed and resuspended in 10 ml of PBS buffer. Cells are 
30 broken in a French pressure cell press SLM Aminco at 20.000 psi (three passages). 
The extract is centrifliged at 16.000 g for 30 minutes at 4°C. 
Analysis on Coomassie-stained SDS-polyacrylamide gels and Western blots 
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After centrifugaiion of extracts descnbed above, aliquots of supernatant and 
pellet were analysed by SDS-polyacrylamide gel electrophoresis and Western 
blotting. 

The fusion protDl/3-E7-His (about 31 kDa) was visualised by Coomassie 
5 stained gels in the pellet fraction for strain BIO 1 1 and partially localized (30%) in the 
supernatant fraction for strain B 1012 and was identified in Western blots by rabbit 
polyclonal anti-protein-D and by Ni-NTA conjugate coupled to calf intestinal alkaline 
phosphatase (Qiagen cat. n*^ 34510) which detects accessible histidine tail. The level 
of expression represents about 1-3% of total protein as shown on a Coomassie-stained 
10 SDS-polyacrylamide gel. 

For the extract of strain B 101 2 the thioredoxin ( about 12 KDa) was visualised 
by coomassie stained gel in the supernatant and identified in western blots by 
monoclonal anti thioredoxin ( Invitrogen R920-25) 

EXAMPLE X: Construction of E.coli'strain B1098 expressing fusion ProtDl/3- 

15 E7 

Mutated (cys27->gly,glu29->gln ) type HP VI 8 
l)-Construction of expression plasmid 
Starting material: 

a) - Plasmid pRIT 14532 ( = TCA 316) which codes for fusion ProtDl/3-E7 -His 
20 b) - Plasmid LITMUS 28 ( New England Biolabs cat n° 306-28 ) , a cloning vector 
pUC-derived 

c) - Plasmid pMG MCS ProtDl/3 ( pRIT 14589) , a derivative of pMGSl (described 
supra) in which the codons 4-81 of NSl coding region from Influenza were replaced 
by the codons corresponding to residues Ser 20 -> Thr 127 of mature protein D of 

25 Haemophilus Influenzae strain 772, biotype 2 (H, Janson et a/., 1991 , Infection and 
Immunity, Jan. p.l 19-125). The sequence of Prot-Dl/3 is followed by a multiple 
cloning site (1 1 residues) and a coding region for a C-terminal histidine tail (6 His) 
Construction of plasmid pRIT 14831( =^TCA355): a plasmid expressing the 
fusion Protein-Diy3-E7 mutated ( cys27->giy ,glu29->gln) with His tail 

30 The Ncol - Xbal fragment from pRIT 1 4532 (=TCA 316), bearing the coding 

sequence of E7 gene from HPV18 , elongated with an His tail , was subcloned in an 
intermediate vector Litmus 28 useful for mutagenesis to give pRIT 14910 (=TCA348) 



wo 99/33868 PCT/EP98/08563 

By analogy with E7/HPVi6 mutagenesis, double mutations cys27">gly and 
glu29->gln were chosen to impair the binding to the antioncogene product of 
Retinoblastome gene ( pRB ). 

The introduction of mutations in E7 gene was realized with the kit " Quick 
5 Change Site directed Mutagenesis ( Stratagene cat n° 200518) .As the sequencing of 
pRIT 14532 had pointed out the presence of a glutamic acid in position 43 of E7 
instead of a glycine in the prototype sequence of HPVl 8 , a second cycle of 
mutagenesis was realized to introduce a glycine in position 43 . We obtained plasmid 
pRIT 14829 ( = TCA353 ). After verification of presence of mutations and integrity 
10 of the complete E7 gene by sequencing , the mutated E7 gene was introduced into 
vector pRIT 14589 ( = pMG MCS ProtDl/3 ) to give plasmid pRlT 14831 
(=TCA355). 

The protein and coding sequence for the fusion protein-Dl/3-E 7 mutated 
(cys27->gly, glu29->gln ) -His is described in sequence ID No. 18 and 19. 
15 2)Constniction of strain B1098 expressing ProtDl/3-E7mutated (cys 27-->gly , 
glu29-->gln )-His /HPV18 

Plasmid pRIT 14831 was introduced into E.coli AR58 ( Mott et al. ,1985, 
Proc. Nali. Acad. Sci. , 82:88 ) a defective X lysogen containing a thermosensitive 
repressor of the X pL promoter ,to give strain B 1098 , by selection for transformants 
20 resistant to kanamycin. 

3) -Growth and induction of bacteria! strain B1098 - Expression of ProtDl/3-E7 
mutated (cys 27->gly , glu29->gln )-His /HPV18 

Cells of AR58 transformed v^dth plasmid pRIT 14831 ( B1098 strain ) were 
grown at 30°C in 100 ml of LB medium supplemented with 50 (igr /ml of Kanamycin. 
25 During the logarithmic phase of growth bacteria were shifted to 39°C to inactivate the 
X repressor and turn on the synthesis of ProtDl/3-E7 mutated -His /HPV18 . The 
incubation at 39*^0 was continued for 4 hours. Bacteria were pelleted and stored at - 
20''C. 

4) -Characterization of fusion ProtDl/3-E7 mut (cys24->gly, glu26->gln)- His type 
30 HPV16 




wo 99/33868 


PCT/EP98/08563 


Frozen cells were thawed and resuspended in 10 ml of PBS buffer. Ceils were 
broken in a French Pressure cell press SLM Ammco at 20 000 psi ( three passages) . 
The extract was centrifliged at 16000 g for 30 niinutes at 4°C. 
Analysis on Coomassie stained SDS-poiyacrylamidc gels and Western blots 

5 After centrifugation of extracts described above, aiiquots of supernatant and pellet 
were analysed by SDS-polyacryiamide gel electrophoresis and Western blotting. 
A major band of about 31 kDa, localized in the pellet fraction, was visualised by 
Coomassie stained gels and identified in Western blots by rabbit polyclonal 22 J 70 
anti-protein D and by monoclonal Penta-His (Qiagen cat. n° 34660) which detects 

10 accessible histidine tail. The level of expression represents about 3 to5 % of total 
protein. 

EXAMPLE XI: Construction of an E, coli strain expressing fusion Protem-Dl/3- 
E6-his / HPV18 

1. Construction of expression plasmid 

15 a) Plasmid pMG MCS protDl/3 (= pRITl4589) is a derivative of pMG81 (described 
supra) in which the codons 4-81 of NSl coding region from Influenza were replaced 
by the codons corresponding to residues Ser 20 -> Thr 127 of mature protein D of 
Haemophilus Influenzae strain 772, biotype 2 (H. Janson et a/., 1991, Infection and 
Immunity, Jan. p.l 19-125). The sequence of Prot-Dl/3 is followed by a multiple 

20 cloning site (1 1 residues) and a coding region for a C-terminal histidine tail (6 His). 
This plasmid is used to express the fusion protein Dl/3-E6-his. 

HPV genomic E6 and E7 sequences type HPV18 (Cole et aL, J. Mol. Biol. 1987, 193 , 
p.599-608. ) were ampUfied from HPV18 full length genome cloned in pBR322 
(obtained from Deutsches Krebsforschungszentnim (DKFZ), Referenzzentrum fllr 
25 human pathogen Papillomaviruses - D 69120 - Heidelberg) and were subcioned into 
pUC19 to give TCA 302 (= pRIT 14467). 

Construction of plasmid TCA 314(= pRIT14526) : a plasmid expressing the 
fusion Protein-Dl/3-E6-His /HPV18 


polymerase chain reaction. Ncol and Spel restriction sites were generated at the 5' and 
3' ends of the E6 sequences allowing insertion into the same sites of plasmid 


30 


The nucleotides sequences corresponding to amino acids 

I -> 158 of E6 protein were amplified from pRIT14467. During the 
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pMGMCS Prot Dl/3 to give plasmid TCA314 (= pRJT14526). The insert was 
sequenced to verify that no modification had been generated during the polymerase 
chain reaction. The protein and coding sequence for the fusion protein-Dl/3-E6-His 
is described in sequence ID No. 20 and 21. 
5 Transformation of AR58 strain 

Plasmid pRIT14526 was introduced into E. coli AR58 (Mott et at., 1985, Proc. 
Natl. Acad. Sci., 82:88) a defective X lysogen containing a thermosensitive repressor 
of the pL promoter. 

3. Growth and induction of bacterial strain - Expression of Prot-Dl/3-E6-His 

10 Cells of AR58 transformed with plasmid pRIT 14526 were grown in 100 ml of 

LB medium supplemented with 50 |igr/ml of Kanamycin at BO'^C. During the 
logarithmic phase of growth bacteria were shifted to 39^*0 to inactivate the X repressor 
and turn on the synthesis of protein Dl/3-E6-his. The incubation at 39°C was 
continued for 4 hours. Bacteria were pelleted and stored at -20C. 

15 4. Characterization of fusion Protein Dl/3-E6-his 

Frozen cells are thawed and resuspended in 10 ml of PBS buffer. Cells are 
broken in a French pressure cell press SLM Aminco at 20.000 psi (three passages). 
The extract is centrifuged at 16.000 g for 30 minutes at 4''C. After centrifugation of 
extracts described above, aliquots of supernatant and pellet were analysed by SDS- 

20 polyacrylamide gel electrophoresis and Western blotting. 

A major band of about 32 kDa, localized in the pellet fraction, was visualised by 
Coomassie stained gels and identified in Western blots by rabbit polyclonal anti- 
protein-D and by Ni-NTA conjugate coupled to calf intestinal alkaline phosphatase 
(Qiagen cat. n" 34510) which detects accessible histidine tail. The level of expression 

25 represents about 3-5 % of total protein. 

EXAMPLE XII: Construction of an coli strain expressing fusion Protein- 

Dl/3-E6E7-his/ HPV18 

/. Construction of expression plasmid 

a) Plasmid pMG MCS prot Dl/3 (= pRlT14589) is a derivative of pMGSl (described 
30 supra) in which the codons 4-8 1 of NS 1 coding region from Influenza were replaced 
by the codons corresponding to residues Ser 20 -> Thr 127 of mature protein D of 
Haemophilus Influenzae strain 772, biotype 2 (H. Janson et a/., 1991, Infection and 


-26- 




wo 99/33868 


PCT/EP98/08563 


Immunity, Jan. p. 11 9- 125). The sequence of Prot-Dl/3 is followed by a multiple 
cloning site (U residues) and a coding region for a C-terminal histidine tail (6 His). 
This plasmid is used to express the fusion protein Dl/3-E6E7-his. 

b) HPV genomic E6 and E7 sequences type HPV18 (Cole et ai.J.Mol.Biol. 1987, 
5 193, 599-608) were amplified from HPV 1 8 full length genome cloned in pBR322 

(obtained from Deutsches Krebsforschungszentrum (DKFZ), Referenzzentrum fiir 
human pathogen Papillomaviruses - D 69120 - Heidelberg) and were subcloned into 
pUC19 to give TCA 302 (= pRITl4467). 

c) The coding sequences for E6 and E7 in TCA302 (= pRIT 

10 14467) were modified with a synthetic oligonucleotides adaptor (inserted between 
Hga I and Nsi I sites) introducing a deletion of 11 nucleotides between E6 and E7 
genes, removing the stop codon of E6 and creating fused E6 and E7 coding sequences 
in the plasmid TCA320( = pRIT 14618 ). 

Construction of plasmid TCA 328(= pRIT14567) : a plasmid expressing the 

15 fusion Protein-Dl/3-E6E7-His /HPV18 

The nucleotides sequences corresponding to amino acids 
1 -> 263 of fused E6E7 protein were amplified from pRIT146l8. During the 
polymerase chain reaction, Ncol and Spel restriction sites were generated at the 5' and 
3* ends of the E6E7 fused sequences alfowing insertion into the same sites of plasmid 

20 pMGMCS Prot Dl/3 to give plasmid TCA328 (= pRIT14567). The insert was 

sequenced to verify that no modification had been generated during the polymerase 
chain reaction. The protein and coding sequence for the fusion protein-Dl/3-E6E7- 
His is described in sequence ID No. 22 and 23. 

2. Transformation of AR58 strain 

25 Plasmid pRIT14567 was introduced into E. coli AR58 (Mott et al., 1985, Proc. 

Natl. Acad. Sci., 82:88) a defective X lysogen containing a diermosensitive repressor 
of the X pL promoter. 

3. Growth and induction of bacterial strain - Expression of Prot-Dl/3-E6E7-His 


30 LB medium supplemented with 50 ngr/ml of FCanamycin at 30°C. During the 

logarithmic phase of growth bacteria were shifted to 39°C to inactivate the k repressor 


Cells of AR58 transformed with plasmid pRITl45 12 were grown in 100 ml of 
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and mm on the synthesis of protein Dl/3-E6E7-his. The incubation at 39°C was 
continued for 4 hours. Bacteria were pelleted and stored at -20C. 
4.Characterizatioii of fusion Protein Dl/3-E6E7-his 

Frozen cells are thawed and resuspended in 10 ml of PBS buffer. Cells are 
5 broken in a French pressure cell press SLM Aminco at 20.000 psi (three passages). 
The extract is centrifuged at 1 6,000 g for 30 minutes at 4**C. 

After centrifiigation of extracts described above, aliquots of supernatant and 
pellet were analysed by SDS-polyacrylamide gel electrophoresis and Western 
blotting. 

10 A major band of about 48 kDa, localized in the pellet fraction, was visualised by 
Coomassie stained gels and identified in Western blots by rabbit polyclonal anti- 
protein-D and by Ni-NTA conjugate coupled to calf intestinal alkaline phosphatase 
(Qiagen cat. n° 34510) which detects accessible histidine tail. The level of expression 
represents about I % of total protein. 

15 EXAMPLE XIII 

The therapeutic potential of vaccine containing the PDl/3 E7 fusion protein and 
different CpG oligonucleotides were evaluated in the TCI (E7 expressing tumour 
model.) 

1. Therapeutic experiments: protocol 

20 10e6 TCI cells, E7 expressing tumour cells: were injected subcutaneously 

(2001^1) in the flank of C57BL/6 immunocompetent mice. Mice were vaccinated 7 
and 14 days after the tumour challenge, with 5^g ProtD 1/3 E7 HPV16 injected intra- 
footpad (1 00^1 : 50^1 / footpad) in the presence of different adjuvants: 

2 and 4 weeks after the second immunisation, 5 mice/group were killed and 
25 spleens or popliteal lymph nodes were taken and analyzed for immune response. 
1.2 Results 

Groups of mice 

1) PBS 

2) ProtD 1/3 E7HPV 16 

30 3) ProtDl/3 E7 HPV16 + oligo 1: 1826 (WD 1001): tcc ATGACCrrccTG acgtt 

4) Oligo 1 

5) ProtD 1/3 E7 HPV16 + oligo 2/ 1758 (WD1002): tct ccc agc gtg cgc cat 
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6) OUgo 2 

Tumour Growth; 

was monitored by measuring individual tumours twice a week. 

Figure 1 : represents the mean tumour growth (in mm2)/group n=10 followed 
5 over 4 weeks. 

• The injection of 10e6 TCI cells injected subcutaneousiy give rise to a growing 
tumour in 100% of the animals. 

• Vaccinating with ProtDl/3E7 or adjuvant alone: 100% of the animals develop a 
tumour. 

10 • As shovm in figure 1 and 2, in the groups of mice that received die antigen with a 
CpG oligonucleotide the mean tumour growth remained very low and very similar 
between groups, reflecting that the tumour grovrth either was slowed down or that 
several tumours were completely rejected. 

The analysis of individual tumour growth 2 and 4 weeks after the latest 

1 5 vaccination showed that complete rejection in the groups were: 


Day 28 (n=l 0) day 42 (n=5) 

E7+oligol (1826) 40% 40% 

Oligol 0% ' 0% 

E7+oHgo2 (1758) 70% 40% 

01igo2 0% 0% 


The mean tumour growth/group of mice vaccinated with PDl/3 E7+ the CpG 
oligos are quite similar and analysis of the individual tumour growth showed that the 
20 CpG oligos induce prolonged complete tumour rejection. 
Conclusion 

Both CpG (Oligo 2>oligo 1) induced complete tumour regression . 
Lymphoproliferative response was analysed by in vitro restimulation of 
spleen and lymph nodes cells for 72 hrs with either PD1/3E7, the protein E7(Bollen) 
25 and PD (whole) PDl/3 (coated or not on latex nbeads) (10, I, O.l |ig/ml) 2 and 4 
weeks post 11. 

• Positive controls (ConA stimulaltion) were positive. 
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• Surprisingly, no E7 specific and no PD specific proliferative response could be 
observed starting with spleen cells 2 or 4 weeks post II (probably due to a technical 
problem: data not shown). 

• On the contrary, lymph node cells from mice that received ProtDl/3 E7 in CpG 
5 oligos i and 2 showed a very good E7 specific proliferative response although 

almost no PD (whole) specific response could be observed even at the hightest 
concentration of lOO^g/ml no PDl/3 specific responses was observed even when 
coated on latex |ibeads. 

Similar data were obtained 4 weeks post IL 
10 Serology 

The anti E7 antibody response: IgG tot and isotypes (IgGl, lgG2a, IgG2b, 
IgGTot) were measured by ELISA using the E7 protein as coating antigen as 
described in the Materials and Methods. Figures 3 and 4 show the relative percentage 
of the different IgG isotypes in the total of IgGs, 2 and 4 weeks post II respectively. 
15 -The Oligos affect only weakly (oligo 2) or not at all (Oligo 1) the weak antibody 
response observed when PD1/3E7 alone was injected. 
• The predominant E7 specific antibody subclass was clearly IgG2b for all the 
formulation tested (80-90% of the total IgGs), 

The same results were obtained 4Veeks post II 
20 Isotypic profile of anti E7 responses (post II, pooled sera) exp. 97293 


Groups 

IgGl 

IgG2a 

IgG2b 

IgGtot 

I) PBS 

0 

0 

0 

0 

2) ProtDl/3E7HPV16 

1020 

0 

4130 

4740 

3) ProtDl/3 E7 HPV16 + oligo 1 

170 

400 

3680 

4910 

4) Oligo 1 

0 

0 

530 

420 

5) ProtDl/3 E7 HPV16 + oligo 2 

0 

590 

7560 

13690 

6) Oligo 2 

0 

0 

0 

0 


Groups 

IgGl 

IgG2a 

IgG2b 

IgGtot 

1) PBS 

0 

0 

0 

0 

2) ProtDl/3 E7HPV I 

240 

0 

1650 

1400 

3) ProtDl/3 E7HPV1 6 + oligo 1 

0 

0 

1280 

1430 

4) Oligo 1 

0 

0 

0 

0 

5) ProtDl/3 E7HPV16 + oligo 2 

0 

560 

3600 

5880 

6) Oligo 2 

0 

0 

0 

0 
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CTL assay: 

A CTL response could be detected when measured 2 weeks after the latest 
vaccination, when ceils were re-stimulated in vitro with irradiated TCI when TCI or 
5 peptide E7 pulsed EL4, were used as target cells, when mice immunised with PDl/3 
E7 + CpG oligo 2> 1 (25-40% specific lysis) and not with oligos alone. 
• Lysis was seen on TC 1 cells than on peptide E7 pulsed EL4 cells, but this is mostly 
observed in the groups of mice vaccinated with PD1/3E7 + CpG oligos (2>I). In 
this experiment other formulations did not induce a CTL. 
10 • Using E7 pulsed EL4 cells, no lysis was observed when mice received the protein 
or the adjuvant alone. 


1.3 Materials and Methods 


Component 

Brand 

Batch number 

Concentration 
(mg/mi) 

Buffer 

ProtDl/3-E7 


957/015 

0.677 

PBS 7,4 

oligo CpG 
1826 

EuroGentec 

WDipOl 

5 

H2O 

oligo CpG 

EuroGentec 

WD 1002 

5 

H^O 


13.1 Formulation Process 

15 All the formulations were prepared on the day of injection. 

Oligo containing formulations 

Formulations containing oligo alone without other adjuvant were prepared by 
addition of CpG to the diluted PrtDl/3-E7 in PBS pH 7.4. 

The adjuvant controls without antigen were prepared by replacing the protein 
20 by PBS. 

i3J2 Mice and Cell lines 

Mice C57B1/6 (Iffa Credo) 6-8 weeks old mice were used in these 
experiments. 

Cell lines: TCI (obtained from the John Hopkin's University) , or EL4 cells 
25 were grown in RPMI 1640 (Bio Whittaker) containig 10% PCS and additives: 2mM 
L-Glutamine . 1% antibiotics (lOOOOU/ml penicilin, 10000^g/ml streptomycin) 1% 
non essential amino acid lOOx, 1% sodium pyruvate (Gibco), 5 lOe-5 M 2- 
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mercapioethanol. Before injection TCI cells were irypsynized and washed in serum 

free medium. 

U.3 Tumour growth: 

All the animals were injected with tumor cells on day O and were randomized 

5 at day 7. Individual tumor growth was followed over time ( the 2 main diameters ( A, 
B) were measured using calipers twice a week, A x B represents the "tumor surface" 
and the average of the 5 values / groups is showed on a graphic over time: 6 weeks 
13A CMI read out 

In vitro lymphoproliferation 

10 Lymphoproliferation was performed on individual spleens and on lymph node 

pools. 200000 spleen cells or popliteal lymph node cells were plated in triplicate, in 
96 well microplate, in RPMI medium containing 1% normal mice serum and 
additives . After 72 hrs of in vitro re-stimulation with different amounts of PDl/3 
E7 (1. O.l, 0.01 ng/ml) or E7 ( 10-1-0,1 ^g/ml ) After 72hrs, 100 ^l of culture 

15 supernatant were removed and replaced by fresh medium containing IjxCi 3H 
thymidine (Amersham 5Ci/mmol) . After 16 hrs, cells were harvested onto filter 
plates. Incorporated radioactivity was counted in a (J counter. Results are expressed in 
CPM (mean of triplicate wells) or as stimulation indexes (mean CPM in cultures with 
antigen / mean CPM in cultures without antigen). 

20 

1.3.5 CTL assay 

20 10e6 spleen cells were co-cultured with 2 10e6 irradiated (ISOOOr) TCI 
cells (E7 expressing tumor) for 7 days in the presenced or absence of ConA sup, 
(2%) 

25 Target cells used to assess cytotoxicity were either Cr5l ( DuPont NEN 

37MBq/ml) loaded (Ihr at 37°C) TCI cells or E7 pulsed EL4 cells (for I hr at 37''C 
during the Cr 51 loading of the cellsl0^g/ml of E7-derived peptide (49-57) (QCB) 
compared to EL4 cells NK dependant lysis was assessed on K562 target cells 2000 
target cells were added / well of 96 well plate ( V botttom nunc 2-45128) with 100/1 

30 being the highest Effector / target ratio. Controls for spontaneous or maximal Cr5 1 
release were perforated in sextuplet and were targets in medium or in triton 1.5%. All 
plates were gently centrifuged and incubated for 4 hrs at 37 in 7% C02. 50 pi of the 
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supernatant was deposed on 96w Lumaplate (Packard) let dry O/N and counted in a 
Top Count counter. Data is expressed as percent specific lysis which is calculated 
from the c.p.m. by the formula (experimental release - spontaneous release) / 
(maximal release - spontaneous release) X 100. 


Quantitation of anti E7 antibody was performed by Elisa using E7as coating 
antigen. Antigen and antibody solutions were used at 50 \i\ per well. Antigen was 
diluted at a final concentration of 3 (ig/ml in carbonate buffer ph9.5 and was adsorbed 
overnight at 4*^0 to the wells of 96 wells microliter plates (Maxisorb Immuno-plate, 

10 Nunc, Denmark). The plates were then incubated for Ihr at 37°c with PBS containing 
1% bovine serum albumin and 0.1% Tween 20 (saturation buffer). Two-fold dilutions 
of sera (starting at 1/100 dilution) in the saturation buffer were added to the E7-coated 
plates and incubated for 1 hr 30 min at 37°c. The plates were washed 3 times with 
PBS 0.1% Tween 20 and biotin-conjugated anti-mouse IgGl, IgG2a or IgG2b or 

15 IgGtot (Amersham, UK) diluted 1/5000 in saturation buffer was added to each well 
and incubated for I hr 30 min at 37°c. After a washing step, streptavidin-biotinylated 
peroxydase complex (Amersham, UK) diluted 1/5000 in saturation buffer was added 
for an additional 30 min at 37°c. Plates were washed as above and incubated for 10 
min with TMB( tetra-methyl-benzidine). The reaction was stopped with H2S04 4N 

20 and read at 450 nm. Midpoint dilutions were calculated by SoftmaxPro (using a four 
parameters equation ). 
EXAMPLE XIV 

In a second experiment, the vaccine of the invention were tested to assess the 
significance of the backbone: 
25 Therapeutic experiment: protocol 

• 10e6 TCI cells , E7 expressing tumor cells : were injected subcutaneously 
(200nl) in the flank of immunocompetent C57BL/6 mice. 

30 • 2 vaccinations, 7 and 14 days after the tumor challenge, with 5|ag ProtD 1/3 E7 
HPV16 injected intra- footpad (100 nl : 50|il / footpad) +/- CpG oligo; Oligo 
1 (WD 1001) as a phosphorothioate modified or the same Oligo (WD 1006) but with 
phosphodiester linkage. 


5 


Serology 
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• 5 animals /group. 

The tumor growth was momtored by measuring individual tumors twice a week and 
5 the mean tumor growth/' group of 5 animals is depicted in figure 5 and show the 
phosphorothioate modified oligonucleotides are effective in bringing about tumour 
regression. 

Conclusions: 

iO 

• All the animals that received 1 0e6 TC 1 tumor cells develop a growing tumor. 

• 1 00% of the animals vaccinated twice, 7 days apart, with the PD 1/3 E7 HP V 1 6 
protein alone develop a tumor. 

15 • 100% of the animals receiving the PDl/3 E7 protein + oligo WD 1006 develop a 
tumor at the concentrations tested 

• All the groups of animals that received the E7 protein + CpG 1001 at a 
concentration ranging from 10 to 200ng show tumor regression partial or 

20 complete(20-40%). 

The first concentration at which this therapeutic effect on tumor regression is not fully 
obtained is E7+ l^ig CpG oligo 1001. * 

25 EXAMPLE XV 

In a third series of experiments, the vaccines of the invention were evaluated in 
transgenic mice expressing E7 protein. 

30 • The transgenic mouse strain has been generated by M. Parmentier and C. Ledent 
at the IRIBHN (ULB). (Ref: PNAS (USA) 1990, 87; 6176-6180). 

• As transgenic mice live with the E7 HP V 1 6 gene from birth, they are considered 
"tolerant" to this gene: E7 from HPV 16, in this situation is considered as a "self 
35 antigen". 
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• The expression of the transgene is driven by the thyroglobulin promoter. As 
Thyrogiobulin is constitutively expressed only In the Thyroid, E7 is expressed in 
the thyroid. 

5 • As a consequence of this expression, thyroid cells proliferate, mouse develop 

goiter and nodules which after 6 months to 1 year can evoiuate in invasive cancer. 

The results (figure 6) of the experiments show that therapeutic vaccination with CpG 
oligonucleotide and antigen as described herein, results in a reduction of tumour 
10 growth and can induce complete tumour regression. 

Material & Methods 

• 10e6 TCI cells, E7 expressing tumor cells : were injected subcutaneously 
15 (200nl) in the flank of male or female C57BL/6 Transgenic 

• mice were vaccinated 7 and 14 days after the tumor challenge, with 5^g ProtD 
1/3 E7 HPV16 injected intra- footpad (100 \i\ : 50^1 / footpad) in the 2 
presence of CpG oligonucleotide TCT CCC AGO GTG CGC CAT and two control 

20 adjuvants:, 

• 1 0 animals /group 

2 and 4 weeks after the second immunization were killed and spleens or popliteal 
25 lymph. 

Conclusion 

The vaccines of the invention are effective in bringing about tumour regression in 
30 HPV induced tumours. 
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CLAIMS 


1 . A composition comprising an E6 or E7 protein or E6/E7 fusion protein from HPV 
optionally linked to an immunological fusion partner, and an immunomodulatory 


2. A composition as claimed in claim 1 wherein the fusion partner is selected from 
the group; protein D or a fragment thereof from Heamophilius influenzae B, 
lipoprotein D or fragment thereof from Heamophilius influenzae B, NSl or 
fragment thereof from Influenzae Virus, and LYTA or fragment thereof from 


3. A composition as claimed in claim 1 or 2 wherein the E6 or E7 proteins are 
derived from HPV 16 or HPV 18. 

4. A composition as claimed in claim K 2 or 3 wherein the E7 protein is mutated. 

5. A composition as claimed in claim 1 , 2 or 3 wherein the E6 protein is mutated. 
15 6. A composition as claimed in any of claims 1 to 5 additionally comprising a 


histidine tag of at least 4 histidine residues. 

7. A composition as claimed herein comprising an additional HPV antigen. 

8. A composition as claimed herein where the immumodulatory CpG oligonucleotide 
comprises a hexamer motif: purine purine cytosine guaine pyrimidine pyrimidine. 

20 9. A composition as claimed herein wherein the inamunomodulatory CpG 
oligonucleotide has two or more CpG motifs. 

10. A composition as claimed herein wherein the CpG oligonucleotide contains a 
phosphorothioate inter-nucleotide linkage. 

1 1 . A composition as claimed herein wherein the CpG oligonucleotide is selected 
25 from the group: 


5 


CpG oligonucleotide. 


10 


Streptococcus Pneumoniae. 


OLIGO I : TCC ATG ACG TTC CTG ACG TT 
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OUGO 2: TCT CCC AGC GTG CGC CAT 

OLIGO 3: ACC GAT GAG GTC GCC GGT GAG GGC AGO ACG 


12. A composition as claimed herein for use in medicine. 

5 1 3. A method of inducing an immune response in a patient to an HPV antigen 

comprising administering a safe and effective amount of a composition as claimed 
herein. 

14. A method of preventing or treating HPV induced tumours in a patient comprising 
administering a safe and effective amount of a composition as claimed herein. 

10 1 5. A method of preparing a composition as claimed herein, comprising admixing an 
E6, E7 or E6/E7 fusion protein optionally linked to an immunological fusion 
parmer, and an immunomodulatory CpG oligonucleotide. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

5 

(i) APPLICANT: BRUCK, CLAUDINE 
(ii) TITLE OF THE INVENTION: VACCINE 


(iii) NUMBER OF SEQUENCES: 23 

(IV) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SmithKline Beecham 
15 (B) STREET; 2 New tiorizons Court, Great West Road, B 

(C) CITY: Middx 

(D) STATE: 

(E) COUNTRY: UK 

(F) ZIP: TW8 9EP 

20 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE; Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

25 (D) SOFTWARE: FastSEQ for Windows Version 2.0 


(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

30 {C} CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

35 


(viii) ATTORNEY /AGENT INFORMATION: 
(A) NAME: Dalton. Marcus J 

40 (B) REGISTRATION NUMBER: 

(C) REFERENCE /DOCKET NUMBER; B4 5124 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 0181 9756348 

45 (B) TELEFAX: 0181 9756177 

(C) TELEX: 


50 


(2) INFORMATION FOR SEQ ID N0:1: 


U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 220 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
55 (D) TOPOLOGY: linear 

Protein D 1/3 E7 his 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

60 Met Asp Pro Ser Ser His Ser Ser Asn Met Ala Asn Thr Gin Met Lys 
15 10 15 

Ser Asp Lys lie He He Ala His Arq Gly Ala Ser Gly Tyr Leu Pro 

20 25 30 

Glu His Thr Leu Glu Ser Lys Ala Leu Ala Phe Ala Gin Gin Ala Asp 
65 35 40 45 

Tyr Leu Glu Gin Asp Leu Ala Met Thr Lys Asp Gly Arc Leu Val Val 
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10 


20 


60 



50 





55 





60 





He 

His 

Asp 

His 

Phe 

Leu 

Asp 

Gly 

Leu 

Thr 

ASD 

Val 

Ala 

Lys 

Lys 

Phe 

65 




70 





75' 





80 

Pro 

His 

Arg 

His 

Arg 

Lys 

Asp 

Gly Arg 

Tyr 

Tyr 

Val 

He 

Asp 

Phe 

Thr 




85 




90 





95 


Leu 

Lys 

Glu 

He 

Gin 

Ser 

Leu 

Glu 

Met 

Thr 

Glu 

Asn 

Phe 

Giu 

Thr 

Met 



ICQ 





105 





110 



Ala 

Met 

His 

Gly Asp 

Thr 

Pro 

Thr 

Leu 

His 

Glu 

Tyr 

Met 

Leu 

Asp 

Leu 



115 





120 





125 




Gin 

Pro 
130 

Glu 

Thr 

Thr 

Asp 

Leu 
135 

Tyr 

Cys 

Tyr 

Glu 

Gin 
140 

Leu 

Asn 

Asp 

Ser 

Ser 

Glu 

Glu 

Glu 

Asp 

Glu 

He 

Asp 

Gly 

Pro 

Ala 

Gly 

Gin 

Ala 

Glu 

Pro 

145 




150 





155 





160 

Asp 

Arg 

Ala 

His 

Tyr 

Asn 

lie 

Val 

Thr 

Phe 

Cys 

Cys 

Lys 

Cys 

Asp 

Ser 



165 





170 





175 


Thr 

Leu Arg 

Leu 

Cys 

Val 

Gin 

Ser 

Thr 

His 

Val 

Asp 

He 

Arg 

Thr 

Leu 




180 





185 





190 



Glu 

Asp 

Leu 

Leu 

Met 

Gly Thr 

Leu 

Gly 

lie 

Val 

Cys 

Pro 

He 

Cys 

Ser 


195 





200 





205 




Gin 

Lys 
210 

Pro 

Thr 

Ser 

Gly 

His 
215 

His 

His 

His 

His 

His 
220 






(2) INFOEIMATION FOR SEQ ID NO: 2: 


25 Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

30 Protein D 1/3 E7 his 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

ATGGATCCAA GCAGCCATTC ATCAAATATG GCGAATACCC AAATGAAATC AGACAAAATC 
35 60 

ATTATTGCTC ACCGTGGTGC TAGCGGTTAT TTACCAGAGC ATACGTTAGA ATCTAAAGCA 
120 

CTTGCGTTTG CACAACAGGC TGATTATTTA GAGCAAGATT TAGCAATGAC TAAGGATGGT 
180 

40 CGTTTAGTGG TTATTCACGA TCACTTTTTA GATGGCTTGA CTGATGTTGC GAAAAAATTC 
240 

CCACATCGTC ATCGTAAAGA TGGCCGTTAC TATGTCATCG ACTTTACCTT AAAAGAAATT 
300 

CAAAGTTTAG AAATGACAGA AAACTTTGAA ACCATGGCCA TGCATGGAGA TACACCTACA 
45 350 

TTGCATGAAT ATATGTTAGA TTTGCAACCA GAGACAACTG ATCTCTACTG TTATGAGCAA 
420 

TTAAATGACA GCTCAGAGGA GGAGGATGAA ATAGATGGTC CAGCTGGACA AGCAGAACCG 
480 

50 GACAGAGCCC ATTACAATAT TGTAACCTTT TGTTGCAAGT GTGACTCTAC GCTTCGGTTG 
540 

TGCGTACAAA GCACACACGT AGACATTCGT ACTTTGGAAG ACCTGTTAAT GGGCACACTA 
600 

GGAATTGTGT GCCCCATCTG TTCTCAGAAA CCAACTAGTG GCCACCATCA CCATCACCAT 
55 660 


TAA 

663 


(2) INFORMATION FOR SEQ ID NO: 3: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
65 (D) TOPOLOGY: linear 

Protein D 1/3 E6 His/HPV 16 
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3 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATGGATCCAA GCAGCCATTC ATCAAATATG GCGAATACCC AAATGAAATC AGACAAAATC 
5 60 

ATTATTGCTC ACCGTGGTGC TAGCGGTTAT TTACCAGAGC ATACGTTAGA ATCTAAAGCA 
120 

CTTGCGTTTG CACAACAGGC TGATTATTTA GAGCAAGATT TAGCAATGAC TAAGGATGGT 
180 

10 CGTTTAGTGG TTATTCACGA TCACTTTTTA GATGGCTTGA CTGATGTTGC GAAAAAATTC 
240 

CCACATCGTC ATCGTAAAGA TGGCCGTTAC TATGTCATCG ACTTTACCTT AAAAGAAATT 
300 

CAAAGTTTAG AAATGACAGA AAACTTTGAA ACCATGGCCA TGTTTCAGGA CCCACAGGAG 
15 360 

CGACCCAGAA AGTTACCACA GTTATGCACA GAGCTGCAAA CAACTATACA TGATATAATA 
420 

TTAGAATGTG TGTACTGCAA GCAACAGTTA CTGCGACGTG AGGTATATGA CTTTGCTTTT 
480 

20 CGGGATTTAT GCATAGTATA TAGAGATGGG AATCCATATG CTGTATGTGA TAAATGTTTA 
540 

AAGTTTTATT CTAAAATTAG TGAGTATAGA CATTATTGTT ATAGTTTGTA TGGAACAACA 
600 

TTAGAACAGC AATACAACA.A ACCGTTGTGT GATTTGTTAA TTAGGTGTAT TAACTGTCAA 
25 660 

AAGCCACTGT GTCCTGAAGA AAAGCAAAGA CATCTGGACA AAAAGCAAAG ATTCCATAAT 
720 

ATAAGGGGTC GGTGGACCGG TCGATGTATG TCTTGTTGCA GATCATCAAG AACACGTAGA 
790 

30 GAAACCCAGC TGACTAGTGG CCACCATCAC CATCACCATT AA 
822 


(2) INFOEy^TION FOR SEO ID NO: 4: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 Protein D 1/3 E6 His/HPV 16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Asp Pro Ser Ser His Ser Ser Asn Met Ala Asn Thr Gin Met Lys 
45 1 5 10 15 

Ser Asp Lys He He He Ala His Arg Gly Ala Ser Gly Tyr Leu Pro 

20 25 30 

Glu His Thr Leu Glu Ser Lys Ala Leu Ala Phe Ala Gin Gin Ala Asp 
35 40 45 

50 Tyr Leu Glu Gin Asp Leu Ala Met Thr Lys Asp Gly Arg Leu Val Val 
50 55 60 

He His Asp His Phe Leu Asp Gly Leu Thr Asp Val Ala Lys Lys Phe 
65 70 75 80 

Pro His Arg His Arg Lys Aso Gly Arg Tyr Tyr Val He Asp Phe Thr 
55 85 " 90 95 

Leu Lys Glu lie Gin Ser Leu Glu Met Thr Glu Asn Phe Glu Thr Met 

100 105 110 

Ala Met Phe Gin Aso Pro Gin Glu Arg Pro Arg Lys Leu Pro Gin Leu 
115 ' 120 125 

60 Cys Thr Glu Leu Gin Thr Thr He His Asp He He Leu Glu Cys Val 
130 135 140 

Tyr Cys Lys Gin Gin Leu Leu Arg Arg Glu Val Tyr Asp Phe Ala Phe 
145 150 155 160 

Arq Asp Leu Cys He Val Tyr Arg Asp Gly Asn Pro Tyr Ala Val Cys 
65 165 170 175 

Asp Lys Cys Leu Lys Phe Tyr Ser Lys He Ser Giu Tyr Arg His Tyr 
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25 





180 




135 





190 



Cys 

Tyr 

Ser 

Leu 

Tyr 

Giy 

Thr Thr 

Leu 

Glu 

Gin 

Gin 

Tyr 

Asn 

Lys 

Pro 

195 




200 





205 




Leu 

Cys 

Asp 

Leu 

Leu 

lie 

Arg Cys 

lie 

Asn 

Cys 

Gin 

Lys 

Pro 

Leu 

Cys 


210 





215 




220 





Pro 

Glu 

Glu 

Lys 

Gin 

Arg 

His Leu 

Asp 

Lys 

Lys 

Gin 

Arg 

Phe 

His 

Asn 

225 




230 




235 





240 

lie 

Arg 

Gly 

Arg 

Trp 

Thr 

Gly Arg 

Cys 

Met 

Ser 

Cys 

Cys 

Arg 

Ser 

Ser 


245 




250 





255 


Arg 

Thr 

Arg 

Arg 

Glu 

Thr 

Gin Leu 

Thr 

Ser 

Giy 

His 

His 

His 

His 

His 


260 




265 





270 



His 
















10 


15 (2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1116 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Protein D 1/3 E6/E7/ HPV16 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 


ATGGATCCAA GCAGCCATTC ATCAAATATG GCGAATACCC AAATGAAATC AGACAAAATC 
60 

ATTATTGCTC ACCGTGGTGC TAGCGGTTAT TTACCAGAGC ATACGTTAGA ATCTAAAGCA 
120 

30 CTTGCGTTTG CACAACAGGC TGATTATTTA GAGCAAGATT TAGCAATGAC TAAGGATGGT 
180 

CGTTTAGTGG TTATTCACGA TCACTTTTTA GATGGCTTGA CTGATGTTGC GAAAAAATTC 
240 

CCACATCGTC ATCGTAAAGA TGGCCGTTAC TATGTCATCG ACTTTACCTT AAAAGAAATT 
35 300 

CAAAGTTTAG AAATGACAGA AAACTTTGAA ACCATGGCCA TGTTTCAGGA CCCACAGGAG 
360 

CGACCCAGAA AGTTACCACA GTTATGCACA GAGCTGCAAA CAACTATACA TGATATAATA 
420 

40 TTAGAATGTG TGTACTGCAA GCAACAGTTA CTGCGACGTG AGGTATATGA CTTTGCTTTT 
480 

CGGGATTTAT GCATAGTATA TAGAGATGGG AATCCATATG CTGTATGTGA TAAATGTTTA 
540 

AAGTTTTATT CTAAAATTAG TGAGTATAGA CATTATTGTT ATAGTTTGTA TGGAACAACA 
45 600 

TTAGAACAGC AATACAACAA ACCGTTGTGT GATTTGTTAA TTAGGTGTAT TAACTGTCAA 
660 

AAGCCACTGT GTCCTGAAGA AAAGCAAAGA CATCTGGACA AAAAGCAAAG ATTCCATAAT 
720 

50 ATAAGGGGTC GGTGGACCGG TCGATGTATG TCTTGTTGCA GATCATCAAG AACACGTAGA 
780 

GAAACCCAGC TGATGCATGG AGATACACCT ACATTGCATG AATATATGTT AGATTTGCAA 
840 

CCAGAGACAA CTGATCTCTA CTGTTATGAG CAATTAAATG ACAGCTCAGA GGAGGAGGAT 
55 900 

GAAATAGATG GTCCAGCTGG ACAAGCAGAA CCGGACAGAG CCCATTACAA TATTGTAACC 
960 

TTTTGTTGCA AGTGTGACTC TACGCTTCGG TTGTGCGTAC AAAGCACACA CGTAGACATT 
1020 

60 CGTACTTTGG AAGACCTGTT AATGGGCACA CTAGGAATTG TGTGCCCCAT CTGTTCTCAG 
1080 

AAACCAACTA GTGGCCACCA TCACCATCAC CATTAA 
1116 

65 (2) INFORMATION FOR SEQ ID NO : 6 : 
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5 

ti) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 372 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
5 (D) TOPOLOGY: linear 

Protein D 1/3 E6/E7/ HPV16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

10 Met Asp Pro Ser Ser His Ser Ser Asn Met Ala Asn Thr Gin Met Lys 
15 10 15 

Ser Asp Lys He He lie Ala His Arg Gly Ala Ser Giy Tyr Leu Pro 

20 25 30 

Glu His Thr Leu Glu Ser Lys Ala Leu Ala Phe Ala Gin Gin Ala Asp 
15 35 40 45 

Tyr Leu Glu Gin Asp Leu Ala Met Thr Lys Asp Gly Arg Leu Val Vai 

50 55 60 

lie His Asp His Phe Leu Asp Gly Leu Thr Asp Val Ala Lys Lys Phe 
65 10 75 80 

20 Pro His Arg His Arg Lys Asp Gly Arg Tyr Tyr Vai lie Asp Phe Thr 
85 90 95 

Leu Lys Glu He Gin Ser Leu Glu Met Thr Glu Asn Phe Glu Thr Met 

100 105 110 

Ala Met Phe Gin Aso Pro Gin Glu Arg Pro Arg Lys Leu Pro Gin Leu 
25 115 ' 120 125 

Cys Thr Glu Leu Gin Thr Thr He His Asp He lie Leu Glu Cys Val 

130 135 140 

Tvr Cys Lys Gin Gin Leu Leu Arg Arg Glu Vai Tyr Asp Phe Ala Phe 
145 150 155 160 

30 Arg Asp Leu Cys lie Vai Tyr Arg Asp Gly Asn Pro Tyr Ala Val Cys 
165 1*^0 1*^5 

Asp Lys Cys Leu Lys Phe Tyr Ser Lys lie Ser Glu Tyr Arg His Tyr 

180 185 190 

CVS Tyr Ser Leu Tyr Giy Thr Thr Leu Glu Gin Gin Tyr Asn Lys Pro 
35 195 200 205 

Leu Cys Asp Leu Leu He Arg Cys He Asn Cys Gin Lys Pro Leu Cys 

210 215 220 

Pro Glu Glu Lys Gin Arg His Leu Asp Lys Lys Gin Arg Phe His Asn 
225 230 235 240 

40 He Arg Gly Arg Trp Thr Gly Arg Cys Met Ser Cys Cys Arg Ser Ser 
245 250 255 

Arg Thr Arg Arg Glu Thr Gin Leu Met His Gly Asp Thr Pro Thr Leu 

260 265 270 

His Glu Tyr Met Leu Asp Leu Gin Pro Glu Thr Thr Asp Leu Tyr Cys 
45 275 280 285 

Tyr Glu Gin Leu Asn Asp Ser Ser Glu Glu Glu Asp Giu He Asp Gly 

290 295 300 

Pro Ala Gly Gin Ala Glu Pro Asp Arg Ala His Tyr Asn He Val Thr 
305 310 315 320 

50 Phe Cys Cys Lys Cys Asp Ser Thr Leu Arg Leu Cys Val Gin Ser Thr 
325 330 335 

His Val Asp He Arg Thr Leu Giu Asp Leu Leu Met Gly Thr Leu Gly 

340 345 350 

He Vai Cys Pro He Cys Ser Gin Lys Pro Thr Ser Gly His His His 
55 355 360 365 

His His His 
370 


60 


(2) INFORMATION FOR SEQ ID NO: 7: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 
tC} STRANDEDNESS: single 

65 (D) TOPOLOGY: linear 

Protein D 1/3 E7 mutated KPV 16 
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6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATGGATCCAA GCAGCCATTC ATCAAATATG GCGAATACCC AAATGAAATC AGACAAAATC 
5 60 

ATTATTGCTC ACCGTGGTGC TAGCGGTTAT TTACCAGAGC ATACGTTAGA ATCTAAAGCA 
120 

CTTGCGTTTG CACAACAGGC TGATTATTTA GAGCAAGATT TAGCAATGAC TAAGGATGGT 
180 

10 CGTTTAGTGG TTATTCACGA TCACTTTTTA GATGGCTTGA CTGATGTTGC GAAAAAATTC 
240 

CCACATCGTC ATCGTAAAGA TGGCCGTTAC TATGTCATCG ACTTTACCTT AAAAGAAATT 
300 

CAAAGTTTAG AAATGACAGA AAACTTTGAA ACCATGGCCA TGCATGGAGA TACACCTACA 
15 360 

TTGCATGAAT ATATGTTAGA TTTGCAACCA GAGACAACTG ATCTCTACGG TTATCAGCAA 
420 

TTAAATGACA GCTCAGAGGA GGAGGATGAA ATAGATGGTC CAGCTGGACA AGCAGAACCG 
480 

20 GACAGAGCCC ATTACAATAT TGTAACCTTT TGTTGCAAGT GTGACTCTAC GCTTCGGTTG 
540 

TGCGTACAAA GCACACACGT AGACATTCGT ACTTTGGAAG ACCTGTTAAT GGGCACACTA 
600 

GGAATTGTGT GCCCCATCTG TTCTCAGAAA CCAACTAGTG GCCACCATCA CCATCACCAT 
25 660 


30 


TAA 
663 


(2) INFORMATION FOR SEQ ID NO : 8 : 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 220 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 

Protein D 1/3 E7 mutated HPV 16 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

40 Met AsD Pro Ser Ser His Ser Ser Asn Met Ala Asn Thr Gin Met Lys 
1 " 5 10 15 

Ser AsD Lvs He He lie Ala His Arg Gly Ala Ser Gly Tyr Leu Pro 

" ' 20 25 30 

Glu His Thr Leu Giu Ser Lys Ala Leu Ala Phe Ala Gin Gin Ala Asp 
45 35 40 45 

Tyr Leu Glu Gin Asp Leu Ala Met Thr Lys Asp Gly Arg Leu Val Val 

50 55 60 

He His Asp His Phe Leu Asp Gly Leu Thr Asp Val Ala Lys Lys Phe 
65 70 75 ao 

50 Pro His Arg His Arg Lys Asp Gly Arg Tyr Tyr Val He Asp Phe Thr 
85 90 95 

Leu Lys Glu lie Gin Ser Leu Glu Met Thr Giu Asn Phe Giu Thr Met 

100 105 110 

Ala Met His Gly Asp Thr Pro Thr Leu His Glu Tyr Met Leu Asp Leu 
55 115 120 125 

Gin Pro Glu Thr Thr Asp Leu Tyr Gly Tyr Gin Gin Leu Asn Asp Ser 

130 135 140 

Ser Giu Glu Glu Asp Glu He Asp Gly Pro Ala Gly Gin Ala Glu Pro 
145 150 155 160 

60 Asp Arq Ala His Tyr Asn He Val Thr Phe Cys Cys Lys Cys Asp Ser 
165 l-^O 175 

Thr Leu Arg Leu Cys Val Gin Ser Thr His Val Asp He Arg Thr Leu 

180 185 190 

Glu ASD Leu Leu Met Gly Thr Leu Gly He Val Cys Pro He Cys Ser 
65 " 195 200 205 

Gin Lys Pro Thr Ser Gly His His His His Hxs HiS 
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210 215 220 


(2) INFORMATION FOR SEQ ID NO: 9: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 879 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
10 CLYTA E6 His HPV 16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 


ATGAAAGGGG GAATTGTACA TTCAGACGGC TCTTATCCAA AAGACAAGTT TGAGAAAATC 
15 60 

AATGGCACTT GGTACTACTT TGACAGTTCA GGCTATATGC TTGCAGACCG CTGGAGGAAG 
120 

CACACAGACG GCAACTGGTA CTGGTTCGAC AACTCAGGCG AAATGGCTAC AGGCTGGAAG 
180 

20 AAAATCGCTG ATAAGTGGTA CTATTTCAAC GAAGAAGGTG CCATGAAGAC AGGCTGGGTC 
240 

AAGTACAAGG ACACTTGGTA CTACTTAGAC GCTAAAGAAG GCGCCATGGT ATCAAATGCC 
300 

TTTATCCAGT CAGCGGACGG AACAGGCTGG TACTACCTCA AACCAGACGG AACACTGGCA 
25 360 

GACAGGCCAG AATTGGCCAG CATGCTGGAC ATGGCCATGT TTCAGGACCC ACAGGAGCGA 
420 

CCCAGAAAGT TACCACAGTT ATGCACAGAG CTGCAAACAA CTATACATGA TATAATATTA 
480 

30 GAATGTGTGT ACTGCAAGCA ACAGTTACTG CGACGTGAGG TATATGACTT TGCTTTTCGG 
540 

GATTTATGCA TAGTATATAG AGATGGGAAT CCATATGCTG TATGTGATAA ATGTTTAAAG 
600 

TTTTATTCTA AAATTAGTGA GTATAGACAT TATTGTTATA GTTTGTATGG AACAACATTA 
35 660 

GAACAGCAAT ACAACAAACC GTTGTGTGAT TTGTTAATTA GGTGTATTAA CTGTCAAAAG 
720 

CCACTGTGTC CTGAAGAAAA GCAAAGACAT CTGGACAAAA AGCAAAGATT CCATAATATA 
780 

40 AGGGGTCGGT GGACCGGTCG ATGTATGTCT TGTTGCAGAT CATCAAGAAC ACGTAGAGAA 
840 

ACCCAGCTGA CTAGTGGCCA CCATCACCAT CACCATTAA 
879 

45 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 293 amino acids 

(B) TYPE; amino acid 

50 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
CLYTA E6 His HPV 16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

55 

Met Lys Giy Gly He Val His Ser Asp Gly Ser Tyr Pro Lys Asp Lys 

1 5 10 15 

Phe Glu Lys He Asn Gly Thr Trp Tyr Tyr Phe Asp Ser Ser Giy Tyr 
20 25 30 

60 Met Leu Ala Asd Arg Trp Arg Lys His Thr Asp Gly Asn Trp Tyr Trp 
35 ' 40 45 

Phe Asp Asn Ser Gly Glu Met Ala Thr Gly Trp Lys Lys He Ala Asp 

50 55 60 

Lys Trp Tyr Tyr Phe Asn Glu Glu Gly Ala Met Lys Thr Gly Trp Val 
65 65 70 75 80 

Lys Tyr Lys Asp Thr Trp Tyr Tyr Leu Asp Ala Lys Glu Gly Ala Met 
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Val 

Ser 

Asn 

Ala 
100 

8 5 
Phe 

He 

Gin 


Leu 

Lys 

Pro 

Asp 

Gly 

Thr 

Leu 

5 



115 






Leu 

Asp Met 

Aia 

Met 

Phe 

Gin 



130 





135 


Pro 

Gin 

Leu 

Cys 

Thr 

Glu 

Leu 


L4 5 




150 


10 

Giu 

Cys 

Val 

Tyr 

Cys 
165 

Lys 

Gin 


Phe 

Ala 

Phe 

Arg 
IBO 

Asp 

Leu 

Cys 


Ala 

Val 

Cys 

Asp 

Lys 

Cys 

Leu 

15 



195 






Arg 

His 
210 

Tyr 

Cys 

Tyr 

Ser 

Leu 
215 


Asn 

Lys 

Pro 

Leu 

Cys 

Asp 

Leu 


225 





230 


20 

Pro 

Leu Cys 

Pro 

Glu 

Glu 

Lys 






245 




Phe 

His 

Asn 

He 

Arg 

Gly Arg 





260 





Arg 

Ser 

Ser 

Arg 

Thr 

Arg 

Arg 

25 

His 

His 

275 
His 

His 





290 
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90 95 


Ser 

Aia 


Gly 

Thr 



Tyr 

Tyr 


105 





110 



Ala 

Asp 


Pro 

Glu 

Leu 

Ala 

Ser 

Met 

120 





125 




Asp 

Pro 

Gin 

Glu 

Arg 

Pro 

Arq 

Lys 

Leu 





140 





Gin 

Thr 

Thr 

lie 

His 

sp 

He 

He 





155 





150 

Gin 

eu 

Leu 

Arg 

Arg 

Glu 

Val 

1 yr 

Asp 



170 





175 


lie 

Val 

Tyr 

^9 

sp 

Gly 

Asn 

Pro 

1 yr 


18 5 





190 



Lys 

Phe 

Tyr 

Ser 

ys 

He 

Ss r 

LjXU 

Tyr 

200 





205 




Tyr 

Gly 

Thr 

Thr 

Leu 

Glu 

Gin 

Gin 

Tyr 



220 




Leu 

He 

Arg 

Cys 

He 

Asn 

Cys 

Gin 

Lys 




235 





240 

Gin 

Arg 

His 

Leu 

Asp 

Lys 

Lys 

Gin 

Arg 



250 





255 


Trp 

Thr 

Gly 

Arg 

Cys 

Met 

Ser 

Cys 

Cys 


265 





270 



Glu 

Thr 

Gin 

Leu 

Thr 

Ser 

Gly 

His 

His 


280 285 


(2) INFORMATION FOR SEQ ID NO: 11: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 720 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
35 (D) TOPOLOGY: linear 

CLYTA E7 HIS HPV 16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

40 ATGAAAGGGG GAATTGTACA TTCAGACGGC TCTTATCCAA AAGACAAGTT TGAGAAAATC 
60 

AATGGCACTT GGTACTACTT TGACAGTTCA GGCTATATGC TTGCAGACCG CTGGAGGAAG 
120 

CACACAGACG GCAACTGGTA CTGGTTCGAC AACTCAGGCG AAATGGCTAC AGGCTGGAAG 
45 130 

AAAATCGCTG ATAAGTGGTA CTATTTCAAC GAAGAAGGTG CCATGAAGAC AGGCTGGGTC 
240 

AAGTACAAGG ACACTTGGTA CTACTTAGAC GCTAAAGAAG GCGCCATGGT ATCAAATGCC 
300 

50 TTTATCCAGT CAGCGGACGG AACAGGCTGG TACTACCTCA AACCAGACGG AACACTGGCA 
360 

GACAGGCCAG AATTGGCCAG CATGCTGGAC ATGGCCATGC ATGGAGATAC ACCTACATTG 
420 

CATGAATATA TGTTAGATTT GCAACCAGAG ACAACTGATC TCTACTGTTA TGAGCAATTA 
55 480 

AATGACAGCT CAGAGGAGGA GGATGAAATA GATGGTCCAG CTGGACAAGC AGAACCGGAC 
540 

AGAGCCCATT ACAATATTGT AACCTTTTGT TGCAAGTGTG ACTCTACGCT TCGGTTGTGC 
600 

60 GTACAAAGCA CACACGTAGA CATTCGTACT TTGGAAGACC TGTTAATGGG CACACTAGGA 
660 

ATTGTGTGCC CCATCTGTTC TCAGAAACCA ACTAGTGGCC ACCATCACCA TCACCATTAA 
720 


65 


(2) INFORMATION FOR SEQ ID NO: 12: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 240 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

CLYTA E7 HIS HPV 16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

10 Met Lys Gly Gly He Val His Ser Asp Giy Ser Tyr Pro Lys Asp Lys 
15 10 15 

Phe Glu Lys He Asn Gly Thr Trp Tyr Tyr Phe Asp Ser Ser Gly Tyr 

20 25 30 

Met Leu Ala Asp Arg Trp Arg Lys His Thr Asp Gly Asn Trp Tyr Trp 
15 35 40 45 

Phe Asp Asn Ser Gly Glu Met Ala Thr Gly Trp Lys Lys He Ala Asp 

50 55 60 

Lys Trp Tyr Tyr Phe Asn Glu Glu Gly Ala Met Lys Thr Gly Trp Val 
65 70 "75 80 

20 Lys Tyr Lys Asp Thr Trp Tyr Tyr Leu Asp Ala Lys Glu Gly Ala Met 
85 90 95 

Val Ser Asn Ala Phe He Gin Ser Ala Asp Gly Thr Gly Trp Tyr Tyr 

100 105 110 

Leu Lys Pro Asp Gly Thr Leu Ala Asp Arg Pro Glu Leu Ala Ser Met 
25 115 120 125 

Leu Asp Met Ala Met His Giy Asp Thr Pro Thr Leu His Glu Tyr Met 

130 135 140 

Leu Asp Leu Gin Pro Glu Thr Thr Asp Leu Tyr Cys Tyr Glu Gin Leu 
145 150 155 160 

30 Asn Asp Ser Ser Glu Glu Glu Asp Glu He Asp Gly Pro Ala Gly Gin 
165 ' 170 175 

Ala Glu Pro Asp Arg Ala His Tyr Asn He Val Thr Phe Cys Cys Lys 

180 185 190 

Cys Asp Ser Thr Leu Arg Leu Cys Val Gin Ser Thr His Val Asp He 
35 195 200 205 

Arg Thr Leu Glu Asp Leu Leu Met Gly Thr Leu Gly He Val Cys Pro 

210 215 220 

He Cys Ser Gin Lys Pro Thr Ser Gly His His His His His His 
225 230 235 


40 


(2) INFORMATION FOR SEQ ID NO: 13: 


(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1173 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
CLYTA E6E7 His HPV16 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

ATGAAAGGGG GAATTGTACA TTCAGACGGC TCTTATCCAA AAGACAAGTT TGAGAAAATC 
60 

AATGGCACTT GGTACTACTT TGACAGTTCA GGCTATATGC TTGCAGACCG CTGGAGGAAG 
55 120 

CACACAGACG GCAACTGGTA CTGGTTCGAC AACTCAGGCG AAATGGCTAC AGGCTGGAAG 
130 

AAAATCGCTG ATAAGTGGTA CTATTTCAAC GAAGAAGGTG CCATGAAGAC AGGCTGGGTC 
240 

60 .lU^GTACAAGG ACACTTGGTA CTACTTAGAC GCTAAAGAAG GCGCCATGGT ATCAAATGCC 
300 

TTTATCCAGT CAGCGGACGG AACAGGC7GG TACTACCTCA AACCAGACGC AACACTGGCA 
360 

GACAGGCCAG AATTGGCCAG CATGCTGGAC ATGGCCATGT TTCAGGACCC ACAGGAGCGA 
65 420 
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CCCAGAAAGT TACCACAGTT ATGCACAGAG CTGCAAACAA CTATACATGA TATAATATTA 
460 

GAATGTGTGT ACTGCAAGCA ACAGTTACTG CGACGTGAGG TATATGACTT TGCTTTTCGG 
540 

5 GATTTATGCA TAGTATATAG AGATGGGAAT CCATATGCTG TATGTGATAA ATGTTTAAAG 
600 

TTTTATTCTA AAATTAGTGA GTATAGACAT TATTGTTATA GTTTGTATGG AACAACATTA 
660 

GAACAGCAAT ACAACAAACC GTTGTGTGAT TTGTTAATTA GGTGTATTAA CTGTCAAAAG 
10 720 

CCACTGTGTC CTGAAGAAAA GCAAAGACAT CTGGACAAAA AGCAAAGATT CCATAATATA 
780 

AGGGGTCGGT GGACCGGTCG ATGTATGTCT TGTTGCAGAT CATCAAGAAC ACGTAGAGAA 
840 

15 ACCCAGCTGA TGCATGGAGA TACACCTACA TTGCATGAAT ATATGTTAGA TTTGCAACCA 
900 

GAGACAACTG ATCTCTACTG TTATGAGCAA TTAAATGACA GCTCAGAGGA GGAGGATGAA 
960 

ATAGATGGTC CAGCTGGACA AGCAGAACCG GACAGAGCCC ATTACAATAT TGTAACCTTT 
20 1020 

TGTTGCAAGT GTGACTCTAC GCTTCGGTTG TGCGTACAAA GCACACACGT AGACATTCGT 
1080 

ACTTTGGAAG ACCTGTTAAT GGGCACACTA GGAATTGTGT GCCCCATCTG TTCTCAGAAA 
1140 

25 CCAACTAGTG GCCACCATCA CCATCACCAT TAA 
1173 

(2) INFOEmATION FOR SEQ ID NO: 14: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 391 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 CLYTA E6E7 His HPV16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Lys Giy Gly lie Val His Ser Asp Gly Ser Tyr Pro Lys Asp Lys 
40 1 5 10 15 

Phe Glu Lys lie Asn Gly Thr Trp Tyr Tyr Phe Asp Ser Ser Gly Tyr 

20 25 30 

Met Leu Ala Asp Arg Trp Arg Lys His Thr Asp Giy Asn Trp Tyr Trp 
35 40 45 

45 Phe Asp Asn Ser Gly Glu Met Ala Thr Gly Trp Lys Lys He Ala Asp 
50 55 60 

Lvs Trp Tyr Tyr Phe Asn Glu Glu Gly Ala Met Lys Thr Giy Trp Val 
65 70 75 80 

Lvs Tvr Lvs Asp Thr Trp Tyr Tyr Leu Asp Ala Lys Glu Gly Ala Met 
50 85 90 95 

Val Ser Asn Ala Phe He Gin Ser Ala Asp Gly Thr Gly Trp Tyr Tyr 

100 105 110 

Leu Lys Pro Asp Gly Thr Leu Ala Asp Arg Pro Glu Leu Ala Ser Met 
115 120 125 

55 Leu Asp Met Ala Met Phe Gin Asp Pro Gin Glu Arg Pro Arg Lys Leu 
130 135 140 

Pro Gin Leu Cys Thr Glu Leu Gin Thr Thr He His Asp He He Leu 
145 150 155 160 

Glu Cys Val Tyr Cys Lys Gin Gin Leu Leu Arg Arg Glu Val Tyr Asp 
60 165 170 175 

Phe Ala Phe Arg Asp Leu Cys He Val Tyr Arg Asp Gly Asn Pro Tyr 

180 185 190 

Ala Val Cys Asp Lys Cys Leu Lys Phe Tyr Ser Lys He Ser Glu Tyr 
195 200 205 

65 Arg His Tyr Cys Tyr Ser Leu Tyr Gly Thr Thr Leu Glu Gin Gin Tyr 
210 215 220 
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Asn Lys Pro Leu Cys Asp Leu Leu He Arg Cys He Asn Cys Gin Lys 
225 230 235 240 

Pro Leu Cys Pro Glu Glu Lys Gin Arg His Leu Asp Lys Lys Gin Arg 
245 250 255 

5 Phe His Asn lie Arg Gly Arg Trp Thr Gly Arg Cys Met Ser Cys Cys 
260 265 270 

Arq Ser Ser Arg Thr Arg Arg Glu Thr Gin Leu Met His Gly Asp Thr 

275 280 285 

Pro Thr Leu His Glu Tyr Met Leu Asp Leu Gin Pro Glu Thr Thr Asp 
10 290 295 300 

Leu Tvr Cvs Tyr Glu Gin Leu Asn Asp Ser Ser Glu Glu Glu Asp Glu 
305 310 315 320 

He AsD Gly Pro Ala Gly Gin Ala Glu Pro Asp Arg Ala His Tyr Asn 
325 330 335 

15 He Val Thr Phe Cys Cys Lys Cys Asp Ser Thr Leu Arg Leu Cys Val 
340 345 350 

Gin Ser Thr His Val Asp He Arg Thr Leu Glu Asp Leu Leu Met Gly 

355 360 365 

Thr Leu Gly He Val Cys Pro He Cys Ser Gin Lys Pro Thr Ser Gly 
20 370 375 380 

His His His His His His 
385 390 


25 


(2) INFORMATION FOR SEQ ID NO: 15: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 634 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear^ 

Protein D 1/3 E7 his HPV 18 


35 


40 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ATGGATCCAA GCAGCCATTC ATCAAATATG GCGAATACCC AAATGAAATC AGACAAAATC 

^ATTATTGCTC ACCGTGGTGC TAGCGGTTAT TTACCAGAGC ATACGTTAGA ATCTAAAGCA 
120 

CTTGCGTTTG CACAACAGGC TGATTATTTA GAGCAAGATT TAGCAATGAC TAAGGATGGT 

^CGTTTAGTGG TTATTCACGA TCACTTTTTA GATGGCTTGA CTGATGTTGC GAAAAAATTC 

^CCACATCGTC ATCGTAAAGA TGGCCGTTAC TATGTCATCG ACTTTACCTT AAAAGAAATT 

^CAAAGTTTAG AAATGACAGA AAACTTTGAA ACCATGGCCA TGCATGGACC TAAGGCAACA 

^TTGCAAGACA TTGTATTGCA TTTAGAGCCC CAAAATGAAA TTCCGGTTGA CCTTCTATGT 
420 

CACGAGCAAT TAAGCGACTC AGAGGAAGAA AACGATGAAA TAGATGAAGT TAATCATCAA 

"^CATTTACCAG CCCGACGAGC CGAACCACAA CGTCACACAA TGTTGTGTAT GTGTTGTAAG 

^TGTGAAGCCA GAATTGAGCT AGTAGTAGAA AGCTCAGCAG ACGACCTTCG AGCATTCCAG 

^CAGCTGTTTC TGAACACCCT GTCCTTTGTG TGTCCGTGGT GTGCATCCCA GCAGACTAGT 
660 

GGCCACCATC ACCATCACCA TTAA 
684 


50 


60 


(2) INFORMATION FOR SEQ ID NO: 16: 


Ci) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 228 amino acids 
65 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

Protein D 1/3 E7 his HPV 18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

5 

Met Asp Pro Ser Ser His Ser Ser Asn Met Ala Asn Thr Gin Met Lys 

15 10 15 

Ser Asp Lys He lie He Ala His Arg Gly Ala Ser Gly Tyr Leu Pro 
20 25 30 

10 Glu His Thr Leu Glu Ser Lys Ala Leu Ala Phe Ala Gin Gin Ala Asp 
35 40 45 

Tyr Leu Glu Gin Asp Leu Ala Met Thr Lys Asp Gly Arg Leu Vai Val 

50 55 60 

He His Asp His Phe Leu Asp Gly Leu Thr Asp Val Ala Lys Lys Phe 
15 65 70 75 80 

Pro His Arg His Arg Lys Asp Gly Arg Tyr Tyr Val lie Asp Phe Thr 

85 90 95 

Leu Lys Glu He Gin Ser Leu Glu Met Thr Glu Asn Phe Glu Thr Met 
100 105 ILO 

20 Ala Met His Gly Pro Lys Ala Thr Leu Gin Asp He Val Leu His Leu 
115 120 125 

Glu Pro Gin Asn Glu He Pro Val Asp Lea Leu Cys His Glu Gin Leu 

130 135 140 

Ser Asp Ser Glu Glu Glu Asn Asp Glu He Asp Glu Val Asn His Gin 
25 145 150 155 160 

His Leu Pro Ala Arg Arg Ala Glu Pro Gin Arg His Thr Met Leu Cys 

165 170 175 

Met Cys Cys Lys Cys Glu Ala Arg He Glu Leu Val Val Glu Ser Ser 
180 185 190 

30 Ala Asp Asp Leu Arg Ala Phe Gin Gin Leu Phe Leu Asn Thr Leu Ser 
195 200 205 

Phe Val Cys Pro Trp Cys Ala Ser Gin Gin Thr Ser Gly His His His 

210 215 220 

His His His 
35 225 

{2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
Thioredoxin 

45 


50 


55 


60 



(xi) SEQUENCE 

DESCRIPTION: 

SEQ 

ID 

NO: 17: 





Met 

Ser 

Asp 

Lys 

He 

He His 

Leu 

Thr 

Asp 

Asp 

Ser 

Phe 

Asp 

Thr Asp 

1 


5 




10 





15 


Val 

Leu 

Lys 

Ala 

Asp 

Gly Ala 

He 

Leu 

Val 

Asp 

Phe 

Trp 

Ala 

Glu 

Trp 



20 



25 





30 



Cys 

Gly 

Pro 

Cys 

Lys 

Met He 

Ala 

Pro 

He 

Leu 

Asp 

Glu 

He 

Ala 

Asp 

35 



40 





45 




Glu 

Tyr 

Gin 

Gly 

Lys 

Leu Thr 

Val 

Ala 

Lys 

Leu 

Asn 

He 

Asp 

Gin 

Asn 


50 


55 





60 





Pro 

Gly 

Thr 

Ala 

Pro 

Lys Tyr 

Gly 

He 

Arg 

Gly 

He 

Pro 

Thr 

Leu 

Leu 

65 




70 




75 





80 

Leu 

Phe 

Lys 

Asn 

Gly 

Glu Val 

Ala 

Ala 

Thr 

Lys 

Val 

Gly 

Ala 

Leu 

Ser 




85 




90 





95 


Lys 

Gly 

Gin 

Leu 

Lys 

Glu Phe 

Leu 

Asp 

Ala 

Asn 

Leu 

Ala 





100 




105 









(2) INFORMATION FOR SEQ ID NO: 18: 

65 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 684 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Protein D 1/3 E7 mutated HPV 18 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

ATGGATCCAA GCAGCCATTC ATCAAATATG GCGAATACCC AAATGAAATC AGACAAAATC 
60 

10 ATTATTGCTC ACCGTGGTGC TAGCGGTTAT TTACCAGAGC ATACGTTAGA ATCTAAAGCA 
120 

CTTGCGTTTG CACAACAGGC TGATTATTTA GAGCAAGATT TAGCAATGAC TAAGGATGGT 
180 

CGTTTAGTGG TTATTCACGA TCACTTTTTA GATGGCTTGA CTGATGTTGC GAAAAAATTC 
15 240 

CCACATCGTC ATCGTAAAGA TGGCCGTTAC TATGTCATCG ACTTTACCTT AAAAGAAATT 
300 

CAAAGTTTAG AAATGACAGA AAACTTTGAA ACCATGGCCA TGCATGGACC TAAGGCAACA 
360 

20 TTGCAAGACA TTGTATTGCA TTTAGAGCCC CAAAATGAAA TTCCGGTTGA CCTTCTAGGT 
420 

CACCAGCAAT TAAGCGACTC AGAGGAAGAA AACGATGAAA TAGATGGAGT TAATCATCAA 

CATTTACCAG CCCGACGAGC CGAACCACAA CGTCACACAA TGTTGTGTAT GTGTTGTAAG 

^TGTGAAGCCA GAATTGAGCT AGTAGTAGAA AGCTCAGCAG ACGACCTTCG AGCATTCCAG 

^CAGCTGTTTC TGAACACCCT GTCCTTTGTG TGTCCG-TGGT GTGCATCCCA GCAGACTAGT 
660 

30 GGCCACCATC ACCATCACCA TTAA 
684 

(2) INFORMATION FOR SEQ ID NO: 19: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 Protein D 1/3 E7 mutated HPV 18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

45 Met Asp Pro Ser Ser His Ser Ser Asn Met Ala Asn Thr Gin Met Lys 
15 10 15 

Ser Asp Lys He He He Ala His Arg Giy Ala Ser Gly Tyr Leu Pro 

20 25 30 

Giu His Thr Leu Glu Ser Lys Ala Leu Ala Phe Ala Gin Gin Ala Asp 
50 35 40 45 

Tyr Leu Glu Gin Asp Leu Ala Met Thr Lys Asp Gly Arg Leu Val Val 

50 55 60 

He His Asp His Phe Leu Asp Gly Leu Thr Asp Val Ala Lys Lys Phe 
65 70 80 

55 Pro His Arg His Arg Lys Asp Gly Arg Tyr Tyr Val He Asp Phe Thr 
85 50 95 

Leu Lys Glu He Gin Ser Leu Glu Met Thr Glu Asn Phe Glu Thr Met 

100 105 110 

Ala Met His Giy Pro Lys Ala Thr Leu Gin Asp He Val Leu His Leu 
60 115 120 125 

Glu Pro Gin Asn Glu He Pro Val Asp Leu Leu Gly His Gin Gin Leu 

130 135 140 

Ser Asp Ser Glu Glu Glu Asn Asp Glu He Asp Gly Val Asn His Gin 
145 150 155 160 

65 His Leu Pro Ala Arg Arg Ala Glu Pro Gin Arg His Thr Met Leu Cys 
165 170 1"^^ 
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Met Cys Cys Lys Cys Giu Ala Arg lie GIm Leu Vai Val Giu Ser Ser 

180 185 190 

Ala Asp Asp Leu Arg Ala Phe Gin Gin Leu Phe Leu Asn Thr Leu Ser 
195 200 205 

5 Phe Val Cys Pro Trp Cys Ala Ser Gin Gin Thr Ser Gly His His His 
210 215 220 

His His His 
225 

10 (2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 37 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Protein D 1/3 E6 - His HPV 18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

20 

ATGGATCCAA GCAGCCATTC ATCAAATATG GCGAATACCC AAATGAAATC AGACAAAATC 
60 

ATTATTGCTC ACCGTGGTGC TAGCGGTTAT TTACCAGAGC ATACGTTAGA ATCTAAAGCA 
120 

25 CTTGCGTTTG CACAACAGGC TGATTATTTA GAGCAAGATT TAGCAATGAC TAAGGATGGT 
180 

CGTTTAGTGG TTATTCACGA TCACTTTTTA GATGGCTTGA CTGATGTTGC GAAAAAATTC 
240 

CCACATCGTC ATCGTAAAGA TGGCCGTTAC TATGTCATCG ACTTTACCTT AAAAGAAATT 
30 300 

CAAAGTTTAG AAATGACAGA AAACTTTGAA ACCATGGCGC GCTTTGAGGA TCCAACACGG 
360 

CGACCCTACA AGCTACCTGA TCTGTGCACG GAACTGAACA CTTCACTGCA AGACATAGAA 
420 

35 ATAACCTGTG TATATTGCAA GACAGTATTG GAACTTACAG AGGTATTTGA ATTTGCATTT 
480 

AAAGATTTAT TTGTGGTGTA TAGAGACAGT ATACCGCATG CTGCATGCCA TAAATGTATA 
540 

GATTTTTATT CTAGAATTAG AGAATTAAGA CATTATTCAG ACTCTGTGTA TGGAGACACA 
40 600 

TTGGAAAAAC TAACTAACAC TGGGTTATAC AATTTATTAA TAAGGTGCCT GCGGTGCCAG 
660 

AAACCGTTGA ATCCAGCAGA AAAACTTAGA CACCTTAATG AAAAACGACG ATTTCACAAC 
720 

45 ATAGCTGGGC ACTATAGAGG CCAGTGCCAT TCGTGCTGCA ACCGAGCACG ACAGGAACGA 
780 

CTCCAACGAC GCAGAGAAAC ACAAGTAACT AGTGGCCACC ATCACCATCA CCATTAA 
937 

50 (2) INFORMATION FOR SEQ ID NO: 21; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 9 amino acids 

(B) TYPE: amino acid 

55 CC) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Protein D 1/3 E6 - His HPV 18 


60 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 


Met Asp P-o Ser Ser His Ser Ser Asn Met Ala Asn Thr Gin Met Lys 

1 ' 5 10 15 

Ser Asp Lys He He He Ala His Arg Gly Ala Ser Gly Tyr Leu Pro 
20 26 30 

65 Giu His T^r Leu Giu Ser Lys Ala Leu Ala Phe Ala Gin Gin Ala Asp 
35 40 45 
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Tyr Leu Glu Gin Asp Leu Ala Met Thr Lys Asp Gly Arg Leu Vai Val 

50 55 60 

lie His AsD His Phe Leu Asp Gly Leu Thr Asp Val Ala Lys Lys Phe 
65 ' 70 75 80 

5 Pro His Arg His Arg Lys Asp Gly Arg Tyr Tyr Val lie Asp Phe Thr 
85 90 95 

Leu Lys Glu lie Gin Ser Leu Glu Met Thr Glu Asn Phe Glu Thr Met 

100 105 110 

Ala Arg Phe Glu Asp Pro Thr Arg Arg Pro Tyr Lys Leu Pro Asp Leu 
10 115 120 125 

Cys Thr Glu Leu Asn Thr Ser Lea Gin Asp lie Glu He Thr Cys Vai 

130 135 140 

Tyr Cys Lys Thr Val Leu Glu Leu Thr Glu Val Phe Glu Phe Ala Phe 
145 150 155 160 

15 Lys Asp Leu Phe Val Val Tyr Arg Asp Ser He Pro His Ala Ala Cys 
165 170 175 

His Lys Cys He Asp Phe Tyr Ser Arg lie Arg Glu Leu Arg His Tyr 

180 185 190 

Ser Asp Ser Val Tyr Gly Asp Thr Leu Glu Lys Leu Thr Asn Thr Gly 
20 195 200 205 

Leu Tyr Asn Leu Leu He Arg Cys Leu Arg Cys Gin Lys Pro Leu Asn 

210 215 220 

Pro Ala Glu Lys Leu Arg His Leu Asn Glu Lys Arg Arg Phe His Asn 
225 230 235 240 

25 He Ala Gly His Tyr Arg Gly Gin Cys His Ser Cys Cys Asn Arg Ala 
245 250 255 

Arg Gin Glu Arg Leu Gin Arg Arg Arg Glu Thr Gin Val Thr Ser Gly 

260 265 270 

His His His His His His 
30 275 

(2) INFORMATION FOR SEQ ID NO: 22: 

{i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1152 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

40 Protein Dl/3 E6 E7 His/ HPV 18 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

ATGGATCCAA GCAGCCATTC ATCAAATATG GCGAATACCC AAATGAAATC AGACAAAATC 
60 

45 ATTATTGCTC ACCGTGGTGC TAGCGGTTAT TTACCAGAGC ATACGTTAGA ATCTAAAGCA 
120 

CTTGCGTTTG CACAACAGGC TGATTATTTA GAGCAAGATT TAGCAATGAC TAAGGATGGT 
180 

CGTTTAGTGG TTATTCACGA TCACTTTTTA GATGGCTTGA CTGATGTTGC GAAAAAATTC 
50 240 

CCACATCGTC ATCGTAAAGA TGGCCGTTAC TATGTCATCG ACTTTACCTT AAAAGAAATT 
300 

CAAAGTTTAG AAATGACAGA AAACTTTGAA ACCATGGCGC GCTTTGAGGA TCCAACACGG 
360 

55 CGACCCTACA AGCTACCTGA TCTGTGCACG GAACTGAACA CTTCACTGCA AGACATAGAA 
420 

ATAACCTGTG TATATTGCAA GACAGTATTG GAACTTACAG AGGTATTTGA ATTTGCATTT 
480 

AAAGATTTAT TTGTGGTGTA TAGAGACAGT ATACCGCATG CTGCATGCCA TAAATGTATA 
60 540 

GATTTTTATT CTAGAATTAG AGAATTAAGA CATTATTCAG ACTCTGTGTA TGGAGACACA 
600 

TTGGAAAAAC TAACTAACAC TGGGTTATAC AATTTATTAA TAAGGTGCCT GCGGTGCCAG 
660 

65 AAACCGTTGA ATCCAGCAGA AAAACTTAGA CACCTTAATG AAAAACGACG ATTTCACAAC 
720 
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ATAGCTGGGC 
780 

CTCCAACGAC 
840 

GTATTGCATT 

ACTATAGAGG 

CCAGTGCCAT 

TCGTGCTGCA 

ACCGAGCACG 

ACAGGAACGA 


GCAGAGAAAC 

ACAAGTAATG 

CATGGACCTA 

AGGCAACATT 

GCAAGACATT 

5 

TAGAGCCCCA 

AAATGAAATT 

CCGGTTGACC 

TTCTATGTCA 

CGAGCAATTA 


900 
AGCGACTCAG 

AGGAAGAAAA 

CGATGAAATA 

GATGGAGTTA 

ATCATCAACA 

TTTACCAGCC 


960 
CGACGAGCCG 

AACCACAACG 

TCACACAATG 

TTGTGTATGT 

GTTGTAAGTG 

TGAAGCCAGA 

10 

1020 
ATTGAGCTAG 

TAGTAGAAAG 

CTCAGCAGAC 

GACCTTCGAG 

CATTCCAGCA 

GCTGTTTCTG 


1080 
AACACCCTGT 

CCTTTGTGTG 

TCCGTGGTGT 

GCATCCCAGC 

AGACTAGTGG 

CCACCATCAC 


1140 






15 

CATCACCATT 
1152 

AA 






(2) INFORMATION FOR SEQ ID NO: 23: 

20 (i) SEQUENCE CHAEIACTERISTICS : 

(A) LENGTH: 384 amino acids 

(B) TYPE: amino acid 

(C) STFANDEDNESS: single 
(Dl TOPOLOGY: linear 

25 Protein Dl/3 E6 E7 His/ HPV 18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Met Asp Pro Ser Ser His Ser Ser Asn Met Ala Asn Thr Gin Met Lys 
30 1 5 10 15 

Ser Asp Lys He He He Ala His Arg Gly Ala Ser Giy Tyr Leu Pro 

20 25 30 

Glu His Thr Leu Glu Ser Lys Ala Leu Ala Phe Ala Gin Gin Ala Asp 
35 40 45 

35 Tyr Leu Glu Gin Asp Leu Ala Met Thr Lys Asp Gly Arg Leu Val Val 
50 55 60 

He His Asp His Phe Leu Asp Gly Leu Thr Asp Val Ala Lys Lys Phe 
65 70 75 80 

Pro His Arg His Arg Lys Asp Gly Arg Tyr Tyr Val He Asp Phe Thr 
40 85 90 95 

Leu Lys Glu He Gin Ser Leu Glu Met Thr Glu Asn Phe Glu Thr Met 

100 105 110 

Ala Arg Phe Glu Asp Pro Thr Arg Arg Pro Tyr Lys Leu Pro Asp Leu 
115 120 125 

45 Cys Thr Glu Leu Asn Thr Ser Leu Gin Asp He Glu He Thr Cys Val 
130 135 140 

Tyr Cys Lys Thr Val Leu Glu Leu Thr Glu Val Phe Glu Phe Ala Phe 
145 150 155 160 

Lys Asp Leu Phe Val Val Tyr Arg Asp Ser He Pro His Ala Ala Cys 
50 165 170 175 

His Lys Cys He Asp Phe Tyr Ser Arg He Arg Glu Leu Arg His Tyr 

180 185 190 

Ser Asp Ser Val Tyr Gly Asp Thr Leu Glu Lys Leu Thr Asn Thr Gly 
195 200 205 

55 Leu Tyr Asn Leu Leu He Arg Cys Leu Arg Cys Gin Lys Pro Leu Asn 
210 215 220 

Pro Ala Glu Lys Leu Arg His Leu Asn Glu Lys Arg Arg Phe His Asn 
225 230 235 240 

He Ala Gly His Tyr Arg Gly Gin Cys His Ser Cys Cys Asn Arg Ala 
60 245 250 255 

Arq Gin Glu Arq Leu Gin Arg Arg Arg Glu Thr Gin Val Met Hxs Gly 

260 265 270 

Pro Lys Ala Thr Leu Gin Asp He Val Leu His Leu Glu Pro Gin Asn 
275 280 285 

65 Glu He Pro Val Asp Leu Leu Cvs His Glu Gin Leu Ser Asp Ser Glu 
290 295 300 
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GLu 

Glu 

Asn 

Asp 

Glu 

He 

Asp 

305 





310 


Arg 

Arg 

Ala 

Glu 

Pro 

Gin Arg 





325 



Cys 

Glu 

Ala 

Arg 
340 

He 

Glu 

Leu 

Arg 

Ala 

Phe 
355 

Gin 

Gin 

Leu 

Phe 

Trp 

Cys 
370 

Ala 

Ser 

Gin 

Gin 

Thr 
375 



PCT/EP98/08563 


Gly Val Asn His Gin His Leu Pro Ala 
315 • 320 

His Thr -Met Leu Cys Met Cys Cys Lys 

330 335 
Val Val Glu Ser Ser Ala Asp Asp Leu 

345 350 
Leu Asn Thr Leu Ser Phe Val Cys Pro 
360 365 
Ser Gly His His His His His His 
380 
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